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(54) Cloning and expression of microbial phytase 

(57) A nucleotide sequence encoding phytase has 
been isolated and cloned. The coding sequence has 
been inserted into an expression construct which in turn 
has been inserted into a vector capable of transforming 
a microbial expression host. The transformed microbial 
hosts may be used to econonnically produce phytase on 
an industrial scale. The compositions comprising 
phytase produced via the present invention may be 
used in a variety of processes requiring the conversion 
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Description 

The present invention relates to the microbial production of phytase. 

5 Background of the Invention 

Phosphorus is an essential element lor the growth of all organisms. In livestock production, feed must be supple- 
mented with inorganic phosphorus in order to obtain a good growth performance of monogastric aninfials (e.g. pigs, 
poultry and fish). 

10 In contrast, no inorganic phosphate needs to be added to the feedstuffs of ruminant animals. Microorganisms, 
present in the rumen, produce enzymes which catalyze the conversion of phytate (myo-inositolhexakisi3hosphate) to 
inositol and inorganic phosphate. 

Phytate occurs as a storage phosphorus source in virtually all feed substances originating from plants (for a review 
see: Phytic acid, chemistry and applications . E. Graf (ed.), Pilatus Press; Minneapolis, MN, U.S.A. (1986)). Phytate 

15 comprises 1 -3% of all nuts, cereals, legumes, oil seeds, spores and pollen. Complex salts of phytic acid are termed phy- 
tin. Phytic acid is considered to be an anti-nutritional factor since it chelates minerals such as calcium, zinc, magnesium, 
iron and may also react with proteins, thereby decreasing the bioavailability of protein and nutritionally important min- 
erals. 

Phytate phosphorus passes through the gastro-intestinal tiBCt of monogastric animals and is excreted in the 

20 manure. Though some hydrolysis of phytate does occur in tiie colon, the thus-released inorganic phosphorus has no 
nutritional value since inorganic phosphorus Is absorbed only in tiie small intestine. As a consequence, a significant 
amount of the nutritionally important phosphorus is not used by monogasti-ic animals, despite its presence in the feed. 

The exaetion of phytate phosphorus in manure has further consequences. Intensive livestock production has 
increased enormously during the past decades. Consequentiy, tiie amount of manure produced has inaeased corre- 

25 spondingly and has caused environmental problems in various parts of the world. This is due, in part, to the accunfiula- 
tion of phosphate from nr^nure in surface waters which has caused eutrophication. 

The enzymes produced by microorganisms, that catalyze tiie conversion of phytate to inositol and Inorganic phos- 
phorus are broadly known as phytases. Phytase producing microorganisms comprise bacteria such as Bacillus subtilis 
(V.K. Paver and V.J. Jagannattian (1982) J. Bacleriol. 151 1 102-1 108) and Pseudonomas (DJ. Ck)sgrove (1970) Aus- 

30 tral. J. Biol. Sci. 2^ 1207-1220); yeasts such as Saocharomvces cerevisiae (N.R. Nayini and R Markakis (1984) Leb- 
ensmittel Wissenschaft und Technologie 17, 24-26); and fungi such as Aspergillus terreus (K. Yamada, Y. Minoda and 
S. Yamamoto (1986) Agric. Biol. Chem. 32, 1275-1282). Various other Asperoillus species are known to produce 
phytase, of which, the phytase produced by Asperoillus ficuum has been determined to possess one of tiie highest lev- 
els of specific activity as well as having better tiiermostability than phytases produced by other microorganisms (unpub- 

35 lished observations). 

The concept of adding microbial phytase to the feedstuffs of monogastric animals has been previously described 
(Ware, J.H., Bluff, L and Shieh, T.R. (1967) U.S. Patent No. 3,297.548; Nelson. TS., Shieh, TR., Wodzinski, R.J. and 
Ware, J.H. (1 971 ) J. Nutrition 101 1 289-1 294). To date, however, application of tiiis concept has not been commercially 
feasible, due to tiie high cost of flie production of the microbial enzymes (YW. Han (1989) Animal Feed Sd. & Technol. 

40 24. 345-350). For economic reasons, inorganic phosphorus is still added to monogastric animal feedstuffs. 

Microbial phytases have found other industrial uses as well. Exemplary of such utilities is an industrial process for 
the production of starch from cereals such as corn and wheat. Waste products comprising e.g. corn gluten feeds from 
such a wet milling process are sold as animal feed. During tiie steeping process phytase may be supplemented. Con- 
ditions (T « 50*^0 and pH = 5.5) are ideal for fungal phytases (see e.g. European Patent Application 0 321 004 to AIko 

45 Ltd.). Advantageously, animal feeds derived from the waste products of this process will contain phosphate instead of 
phytate. 

It has also been conceived that phytases may be used in soy processing (see Finase^" Enzymes By AIko. a product 
infomiation brochure published by AIko Ltd., Rajamaki, Finland) . Soybean meal contains high levels of the anti-nutri- 
tional factor phytate which renders this protein source unsuitable for application in baby food and feed for fish, calves 
50 and other non-ruminants. Enzymatic upgrading of this valuable protein source Improves the nutritional and commercial 
valued this material. 

Other researchers have become interested in better characterizing various phytases and improving procedures for 
the production and use of these phytases. Ullah has published a procedure for tiie purification of phytase from wild-type 
Aspergillus ficuum. as well as having determined several biochemical parameters of the product obtained by this puri- 
55 fication procedure (Ullah, A. (1988a) Preparative Biochem. 1£, 443-458). Pertinent data obtained by Ullah is presented 
in Table 1, below. 

The amino add sequence of the N-terminus of tiie A. ficuum phytase protein has twice been disclosed by Ullah: 
Ullah, A. (1987) Enzyme and Engineering conference IX, October 4-8, 1987. Santa Barbara, California (poster presen- 
tation); and Ullah, A. (1988b) Prep Biochem. Ifi, 459-471. The amino acid sequence data obtained by Ullah Is repro- 
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duced in Figure 1A, sequence E, below. 

Several interesting observations may be made from the disclosures of Ullah. First of all. the "purified" preparation 
described in Ullah (1988a and 1988b} consists of two protein bands on SDS-PAGE. We have found, however, that 
phytase purified from A. f icuum contains a contaminant and that one of the bands found on SDS-PAGE. identified by 

5 Ullah as a phytase, is originating from this contaminant. 

This difference is also apparent from the amino acid sequencing data published by Ullah (1987, 1988b; compare 
Figure 1 A, sequences A and B with sequence C). We have determined, in fact, that one of the amino acid sequences 
of internal peptides of phytase described by Ullah (see Figure 1 B. sequence E) actually belongs to the contaminating 
100 kDa protein (Figure 1C) which is present in the preparation obtained via the procedure as described by Ullah, and 

10 seen as one of the two bands on SDS-PAGE (Ullah. 1 988a and 1 988b). Ullah does not recognize the presence of such 
a contaminating protein, and instead identifies it as another form of phytase. The presence of such contamination, in 
turn, increases the difficulty in selecting and isolating the actual nucleotide sequence encoding phytase activity. Fur- 
thermore, the presence of the contamination lowers the specific activity value of the protein tested. 

Further regarding the sequence published by Ullah. it should be noted that the amino acid residue at position 12, 

15 has been disclosed by Ullah to be glydne. We have consistentiy found using protein and DNA sequencing techniques, 
that this residue is not a glycine but is in lact a cysteine (see Figures 6 and 8). 

Finally, Ullah discloses that phytase is an 85 kDa protein, with a molecular weight after deglycosylation of 61 .7 kDa 
(Ullah, 1988b). This number, which is much lower than the earlier reported 76 kDa protein (Ullah, A. and Gibson, D. 
(1988) Prep. Biochem. 17(1). 63-91) was based on the relative amount of carbohydrates released by hydrolysis, and 

20 the apparent molecular weight of the native protein on SDS-PAGE. We have found, however, that glycosylated phytase 
has a single apparent molecular weight of 85 kDa, while the deglycosylated protein has an apparent molecular weight 
in the range of 48 - 56.5 kOa. depending on the degree of deglycosylation. 

Mullaney si at. (Filamentous Fungi Conference, April, 1 987, Pacific Grove. California (poster presentation) also dis- 
close the characterization of phytase from A. ficuum . However, this report also contains mention of two protein bands 

25 on SDS-PAGE, one of 85 kDa, and one of 1 00 kDa, which were present in the "purified" protein preparation. These pro- 
tein bands are both identified by the authors as being forms of phytase. A method for transforming microbial hosts is 
proposed, but has not been reported. The cloning and isolation of the DNA sequence encoding phytase has not been 
desaibed. 

It will be appreciated that an economical procedure for the production of phytase will be of significant benefit to. 
30 inter alia, the animal feed industry. One method of producing a more economical phytase would be to use recombinant 
DNA techniques to raise expression levels of the enzyme in various microorganisms known to produce high levels of 
expressed peptides or proteins. To date, however, the isolation and cloning of the DNA sequence encoding phytase 
activity has not been published. 

35 Summarv of the Invention 

The present invention provides a purified and isolated DNA sequence coding for phytase. The isolation and cloning 
of this phytase encoding DNA sequence has been achieved via the use of specific oligonucleotide probes which were 
developed especially for the present invention. Preferred DNA sequences encoding phytases are obtainable from fun- 
40 gal sources, especially filamentous fungi of the genus Aspergillus. 

It is another object of the present invention to provide a vector containing an expression construct which further 
contains at least one copy of at least one, preferably homologous DNA sequence encoding phytase. operably linked to 
an appropriate regulatory region capable of directing the high level expression of peptides or proteins having phytase 
activity in a suitable expression host. 
45 The expression construct provided by the present invention may be inserted into a vector, preferably a plasmid. 
which is capable of transforming a microbial host cell and integrating into the genome. 

It is a further object of the present invention to provide a transformant, preferably, a microbial host which has been 
transformed by a vector as desaibed in the preceding paragraph. The transformed hosts provided by the present inven- 
tion are filamentous fungi of the genera Aspergillus. Trichoderma. Mucor and Penicillium. yeasts of the genera Kluy- 
50 veromvces and Saccharomvces or bacteria of the genus Bacillus . Especially preferred expression hosts are 
filamentous fungi of the genus Aspergillus. The transformed hosts are capable of producing high levels of recombinant 
phytase on an economical, industrial scale. 

In otiier aspects, the invention is directed to recombinant peptides and proteins having phytase activity in glyco- 
sylated or unglycosylated form; to a metiiod for the production of said unglycosylated peptides and proteins; to peptides 
55 and proteins having phytase activity which are free of impurities; and to monoclonal antibodies reactive with these 
recombinant or purified proteins. 

A comparison of tiie biochemical parameters of tiie purified wild-type A. ficuum phytase as obtained by Ullah, 
against the further purified wild-type A. ficuum phytase, obtained via tiie present invention, is found in Table 1 , below. 
Of particular note is the specific activity data wherein it is shown that the purified protein which we have obtained has 
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twice the specific activity of that which was published by Ullah. 

The present invention further provides nucleotide sequences encoding proteins exhibiting phytase activity, as well 
as amino acid sequences of these proteins. The sequences provided may be used to design oligonucleotide probes 
which may in turn be used in hybridization saeening studies for the identification of phytase genes from other species. 
5 especially microbial species, which may be subsequently isolated and doned. 

The sequences provided by the present invention may also be used as starting materials for the construction of 
"second generation" phytases. "Second generation" phytases are phytases. altered by mutagenesis techniques (e.g. 
site-directed mutagenesis), which have properties that differ from those of wild-type phytases or recombinant phytases 
such as those produced by the present invention. For example, the temperature or pH optimum, specific activity or sub- 
10 strate affinity may be altered so as to be better suited for application in a defined process. 

Within the context of the present invention, the term phytase embraces a family of enzymes which catalyze reac- 
tions involving the removal of inorganic phosphorous from various myoinositol phosphates. 

Phytase activity may be measured via a number of assays, the choice of which is not critical to the present inven- 
tion. For purposes of illustration, phytase activity may be determined by measuring the amount of enzyme which liber- 
15 ates inorganic phosphorous from 1 .5 mM sodium phytate at the rate of 1 ^mol/min at 37**C and at pH 5.50. 

It should be noted that the term "phytase" as recited throughout the text of this specification is intended to encom- 
pass all peptides and proteins having phytase activity This point is illustrated in Figure 1 A which compares sequences 
A and B (sequences which have been obtained during the course of the present work) with sequence C (published by 
Ullah, 1 988b). The Figure demonstrates that proteins may be obtained via the present invention which lack the first four 
20 amino acids (the protein of sequence A lacks the first seven amino acids) of the mature A. ficuum phytase protein. 
These proteins, however, retain phytase activity. The complete amino add sequence of the phytase protein, as deduced 
from the corresponding nucleotide sequence, is shown in Figure 8. 

Phytases produced via the present invention may be applied to a variety of processes which require the conversion 
of phytate to inositol and inorganic phosphate. 
25 For example, the production of phytases according to the present invention will reduce production costs of microbial 
phytases in order to allow its economical application in animal feed which eventually will lead to an in vivo price/perform- 
ance ratio confipetitive with inorganic phosphate. As a further benefit, the phosphorus content of manure will be consid- 
erably decreased. 

It will be appredated that the application of phytases, available at a price competitive with inorganic phosphate, will 
30 inaease the degrees of freedom for the compound feed industry to produce a high quality feed. For example, when feed 
is supplemented with phytase. the addition of inorganic phosphate may be omitted and the contents of various materials 
containing phytate may be increased. 

In addition to use in animal feeds and soy processing as discussed above, the phytase obtained via the present 
invention may also be used in diverse industrial applications such as: 

35 

- liquid feed for pigs and poultry It has become common practice to soak feed for several hours prior to feeding. Dur- 
ing this period the enzyme will be able to convert phytate to inositol and inorganic phosphate; 

- an industrial process for the production of inositol or inositol-phosphates from phytate; 

- other industrial processes using substrates that contain phytate such as the starch Industry and in fermentation 
40 industries, such as the brewing industry. Chelation of metal ions by phytate may cause these minerals to be una- 
vailable for the production microorganisms. Enzymatic hydrolysis of phytate prevents these problems. 

These and other objects and advantages of the present invention will become apparent from the following detailed 

description. 

45 

Brief description of the Figures 
Figure 1. 

so A. N-terminal amino acid sequences as determined for purified phytase. The amino acid sequences 

labeled A and B are provided by the present invention, and originate from the phytase subforms with 
isoelectric points of 5.2 and 5.4, respectively Sequence C is cited from Ullah (1987. 1988b, supra). 
The amino acid residue located at position 12 of sequences A and B has been determined by the 
present invention not to be a glycine residue. [* denotes no unambigous identification. denotes no 

55 residue detected.] 

B. N-terminal amino acid sequences of CNBr-cleaved internal phytase fragments. The amino acid 
sequences labeled A and B (apparent molecular weight approximately 2.5 kDa and 36 kDa peptides, 
respectively) are provided by the present invention. Sequences C through E are cited from Ullah 
(1988b, supra). 
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Figure 2. 

5 



C. N-terminal amino acid sequence of a 1 00 kDa protein which has been found by the present inven- 
tion to be present in crude phytase samples. 



A. Oligonucleotide probes designed on basis of the data from Figure 1 A, peptides A through E. 

B. Oligonucleotide probes designed on the basis of the data from Figure 1 B. peptides A and B. 



Figure 3. Oligonucleotide probes used for the isolation of the gene encoding the acid-phosphatase. 

10 

Figure 4. Restriction map of bacteriophage lambda AF201 containing the phytase locus of A. ficuum . The arrow 
indicates the position of the phytase gene and the direction of transcription. Clone # shows the subclones 
derived with indicated restriction enzymes from phage AF201 in pAN 8-1 (for pAF 28-1) and in pUCl9 
(for all other subclones). 

15 

Figures. Physical map of pAF M. The 10 kb BamHI fragment, inserted in pUCl9, contains tiie entire gene 
encoding acid phosphatase from A. ficuum . 

Figure 6. Compilation of the nucleotide sequences of plasmids pAF 2-3, pAF 2-6, and pAF 2-7 encompassing the 
20 chromosomal phytase gene locus. The phytase coding region is located from nucleotide position 210 to 

position 1 713; an intron is present in the chromosomal gene from nucleotide position 254 to position 355. 
Relevant features such as resfriction sites, the phytase start and stop codons. and the intron position are 
indicated. 

25 Figure 7. Detailed physical map of the sequenced phytase chromosomal locus; the arrows indicate the location of 
tiie two exons of the phytase coding region. 

Figure 8. Nucleotide sequence of the translated region of the phytase cDNA fragment and the derived amino add 
sequence of the phytase protein; tiie start of the mature phytase protein is indicated as position +1 . The 
30 amino-terminus of tiie 36 kDa internal protein fragment is located at amino add position 241 , whereas 

tiie 2.5 KDa protein fragment starts at amino add position 390. 

Figure 9. Physical map of tiie phytase expression cassette pAF 2-2S. Arrows indicate the direction of transcription 
of tiie genes. 

35 

Figure 10. lEF-PAGE evidence of the overexpression of phytase in an A. ficuum NRRL 3135 transfbrmant. Equal 
volumes of culture supematant of A. ficuum (lane 1) and transformant pAF 2-2S SP7 (lane 2), grown 
under identical conditions, were analysed on a Phast-System (Pharmacia) lEF-PAGE gel in the pH- 
range of 4.5-6. For comparison, a sample of A. ficuum phytase, purified to homogeneity was induded 
40 either separately (lane 4). or mixed with a culture supernatant (lane 3). The gels were either stained with 

a phosphatase stain described in the text (A), or with a general protein stain (Coomassie Brilliant Blue, 
B). The phytase bands are indicated by an asterisk. 

Figure 1 1 . lEF-PAGE evidence for tiie overexpression of phytase in A. nioer CBS 513.88 ti^ansformants. Equal vol- 
45 umes of culture supernatants of the A. niqer parent sti-ain (lane 1), or flie tofansfomiants pAF 2-25 #8 

(lane 2), pFYT3 #205 (lane 3) & #282 (lane 4) were analysed by lEF-PAGE as described in the legend 
of Fig. 10. The gels were eitiier stained by a general phosphatase activity stain (A) or by a general protein 
stain (B). Phytase bands are indicated by an asterisk 

50 Figure 12. Physical map of pAB 6-1. The 14.5 kb Hindlll DNA insert in pUC19 contains tiie entire glucoamylase 
(AG) locus from A. nkier. 

Figure 13. A schematic view of tiie generation of AG promoter/phytase gene fusfons by the polymerase chain reac- 
tion (PCR). The sequences of all oligonudeotide primers used are indicated in tiie text. 

55 

Figure 1 4. Physical map of tiie phytase ^ression cassette pAF 2-2SH. 

Figure 15. Physical maps of the intermediate constructs pXXFYTI, pXXFYT2 and the phytase expression cas- 
settes PXXFYT3, wherein XX indicates the leader sequence (L). In p18FYT# and p24FYT#, respectively 
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20 



25 



30 



35 



40 



the 1 8 aa and the 24 aa AG leader sequence are inserted whereas in pFYT#, the phytase leader is used. 
Figure 1 6. Physical map of plasmid pFYTS amdS. 
Physical map of plasmid pFYTSINT. 
Physical map of the phytase/AG replacement vector pREPFYT3. 



5 Figure 17. 
Figure 18. 
Figure 19. 

10 



Autoradiographs of chromosomal DNA, digested with Pvuli (A) and Baml-ll (B) and hybridized with the 
^^P-labeled A. f icuum phytase cDNA as probe of the microbial species S. cerevisiae (lane 2); B. subtilis 
(lane 3); K. lactis (lane 4); P. cryspgepum (lane 5); R aerggiposa (lane 6); g. lividang (lane 7); A. niqer 1 
lig (lane 8): A. nioer S ^g (lane 9); blank (lane 10): C. thermocellum (lane 11). Lane 1: marker DNA. 



Detailed Description of the Invention 

The cloning of the genes encoding selected proteins produced by a microorganism can be achieved in various 
ways. One method is by purification of the protein of interest, subsequent determination of its N-terminal amino acid 
sequence and screening of a genomic library of said microorganism using a DNA oligonucleotide probe based on said 
N-terminal amino acid sequence. Examples of the successful application of this procedure are the cloning of the Iso- 
penicillin N-synthetase gene from Cephalosporium acremonium (S.M. Samson etaj. (1985) Nature aifi. 191-194) and 
the isolation of the gene encoding the TAKA anfiylase for Aspergillus oryzae (Boel et ai. (1 986) EP-A-0238023). 

Using this procedure, an attempt has been made to isolate the Asperg illus ficuum gene encoding phytase. The pro- 
tein has been purified extensively, and several biochemical parameters have been determined. The data obtained have 
been compared to the data published by Ullah (1988a). Both sets of data are given in Table 1 , below. 



Table 1 



45 



Biochemical parameters of purified wild-type A. ficuum phytase 


Parameter 


Present invention 


Ullah 


Specific activity* 
Purity SDS-PAGE 
lEF-PAGE 
Km (Affinity constant) 


100 UAng protein 
85 kDa 
3or4bands 
250 mM 


50 UAng protein 
85/ 100 kDa 
not done 
40fiM 


Specificity for: 






lnositol-1-P 

lnositol-2-P 

pH optimum 

Temp, optimum (^'C) 

MW (kDa)** 

MW (unglycosylated)'* 

Isoelectric Point*** 


not active 

Km = 3.3mM 

2.5 and 5.5 

50 

85 

56.5 

5.0-5.4 


not active 
5% activity 
2.5 and 5.5 
58 

85 and 100 

61.7 

4.5 



so 



55 



* Phytase activity is measured t)y Ullah at 58°C rather than at 37<^C. A 
unit of phytase activity is defined as that amouni of enzyme which liber- 
ates inorganic phosphorus from 1 .5 mM sodium phytate at the rate of 1 
^md/min at 37°C and at pH 5.50. To compare the femientation yields 
and the specific activities, the activities disclosed l3y Ullah were con-eded 
for the temperature difference. The corection is based on the difference 
in phytase activity measured at Z^^O and at 58''C as shown in Table III of 
Ullah (1986b). 

** Apparent Molecular weight as determined by SDS-PAGE. 
*** As determined by lEF-PAGE 
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In order to isolate the gene encoding phytase, a first set of oligonucleotide probes was designed aocording to the 
above^Jescribed method (Fig. 2A). The design of these probes was based on the amino acid sequence data. As a con- 
trol for the entire procedure, similar steps were taken to isolate the gene encoding acid-phosphatase, thereby using the 
protein data published by Ullah and Cummins ((1987) Prep. Biochem 17, 397-422). For acid-phosphatase, the conre- 
5 sponding gene has been isolated without difficulties. Howey^er, for phytase. the situation appeared to be different. 
Despite many attempts in which probes derived from the N-terminal amino acid sequence were used, no genomic DNA 
fragments or clones from the genomic library could be isolated which could be positively Identified to encompass the 
gene encoding phytase. 

To overcome this problem, the purified phytase was subjected to CNBr-directed cleavage and the resulting protein 
10 fragments were isolated. The N-terminal amino acid sequences of these fragments were determined (Fig. 1 B), and new 
oligonucleotide probes were designed, based on the new data (Fig. 2B). Surprisingly the new oligonucleotide probes 
did identify specific DNA fragments and were suited to unambiguously identify clones from a genomic library No cross 
hybridization was observed between the new clones or DNA fragments isolated therefrom, and the first set of oligonu- 
cleotide probes or the clones isolated using the first set of probes. 
15 It will be appreciated that this second set of probes may also be used to identify the coding sequences of related 
phytases. 

The newly isolated clones were used as probes in Northern blot hybridizations. A discrete mRNA could only be 
detected when the mRNA was isolated from phytase producing mycelium. When RNA from non-phytase producing 
mycelium was attempted, no hybridization signal was found. The mRNA has a size of about 1800 b, theoretically yield- 
20 ing a protein having a maximal molecular weight of about 60 kDa. This value corresponds to the molecular weight which 
has been determined for the non-glycosylated protein, and the molecular weight of the protein as deduced from the 
DNA sequence. 

Moreover, when introduced into a fungal cell by transformation, an increase in phytase activity could be demon- 
strated. This indicates conclusively that the nucleotide sequence encoding phytase has indeed been isolated. The 

2S amino acid sequences which have been determined for the purified phytase enzyme, and for the CNBr fragments 
obtained therefrom, concur with the amino acid sequence deduced from the sequence which was determined for the 
cloned gene. The nucleotide sequence and the deduced amino acid sequence are given in Figures 6 and 8. and further 
illustrate the cloned sequence encoding phytase. 

The isolation of the nucleotide sequence encoding phytase enables the economical production of phytase on an 

30 industrial scale, via the application of modern recombinant DNA techniques such as gene amplification, the exchange 
of regulatory elements such as e.g. promoters, secretional signals, or combinations thereof. 

Accordingly, the present invention also comprises a transformed expression host capable of the efficient expression 
of high levels of peptides or proteins having phytase activity and. if desired, the efficient expression of acid phos- 
phatases as well. Expression hosts of interest are filamentous fungi selected from the genera Aspergillus. Trichoderma. 

35 Mucor and Penicillium. yeasts selected from the genera Kluyveromyces and Saccharomyces and bacteria of the genus 
Bacillus . Preferably, an expression host is selected which is capable of the efficient secretion of their endogenous pro- 
teins. 

Of particular interest are industrial strains of Aspergillus, especially nioer. ficuum. awamori or oryzae. Alternatively, 
Trichoderma reesei. Mucor miehei. Kluyveromyces lactis. Saccharomyces cerevisiae. Bacillus subtilis or Bacillus 
40 licheniformis may be used. 

The expression constmct will comprise the nucleotide sequences encoding the desired enzyme product to be 
expressed, usually having a signal sequence which is functional in the host and provides for secretion of the product 
peptide or protein. 

Various signal sequences may be used according to the present invention. A signal sequence which is homologous 
45 to the cloned nucleotide sequence to be expressed may be used. Alternatively, a signal sequence which is homologous 
or substantially homologous with the signal sequence of a gene at the target locus of the host may be used to facilitate 
homologous recombination. Furthermore, signal sequences which have been designed to provide for improved secre- 
tion from the selected expression host may also be used. For example, see Von Heyne (1983) Eur. J. Biochem. 133, 
17-21; and Perlman and Halverson (1983) J. Mol. Biol. 167, 391-409. The DNA sequence encoding the signal 
50 sequence may be joined directly through the sequence encoding the processing signal (cleavage recognition site) to 
the sequence encoding the desired protein, or through a short bridge, usually fewer than ten codons. 

Prefen-ed secretional signal sequences to be used within the scope of the present invention are the signal 
sequence homologous to the cloned nucleotide sequence to be expressed, the 18 amino acid glucoamylase (AG) sig- 
nal sequence and the 24 amino acid glucoamylase (AG) signal sequence, the latter two being either homologous or het- 
55 erologous to the nucleotide sequence to be expressed. 

The expression product, or nucleotide sequence of interest may be DNA which is homologous or heterologous to 
the expression host. 

"Homologous" DNA is herein defined as DNA originating from the same genus. For example, Aspergillus is trans- 
formed with DNA from Aspergillus. In this way it is possible to improve already existing properties of the fungal genus 
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without introducing new properties, which were not present In the genus before. 

"Heterologous" DNA is defined as DNA originating from more than one genus, i.e., as follows from the example 
given in the preceding paragraph. DNA originating from a genus other than Aspergillus, which is then expressed in 
Aspergijlgs. 

5 Nucleotide sequences encoding phytase activity are preferably obtained from a fungal source. More preferred are 
phytase encoding nucleotide sequences obtained from the genus Asperoillus. Most preferred sequences are obtained 
from the species Asperplllus ficuum or Aspergillus niqer. 

The region 5' to the open reading frame in the nucleotide sequence of interest will comprise the transcriptional ini- 
tiation regulatory region (or promoter). Any region functional in the host may be employed, including the promoter which 

10 is homologous to the phytase-encoding nucleotide sequence to be expressed. However, for the most part, the region 
which is employed will be homologous with the region of the target locus. This has the effect of substituting the expres- 
sion product of the target locus with the expression product of interest To the extent that the level of expression and 
secretion of the target locus encoded protein provides for efficient production, this transcription initiation regulatory 
region will normally be found to be satisfactory. However, in some instances, one may wish a higher level of transcription 

15 than the target locus gene or one may wish to have inducible expression employing a particular inducing agent. In those 
instances, a transcriptional initiation regulatory region will be employed which is different from the region in the target 
locus gene. A large number of transcriptional initiation regulatory regions are known which are functional in filamentous 
fungi. These regions include those from genes encoding glucoamyiase (AG), fungal amylase, acid phosphatase. 
GAPDH, TrpC. AmdS . AlcA. AjdA, histone H2A, Pyr4, PyrG. isopenicillin N synthetase, PGK, acid protease, acyl trans- 

20 ferase. and the like. 

The target locus will preferably encode a highly expressed protein gene, i.e.. a gene whose expression product is 
expressed to a concentration of at least about 0.1 g/l at the end of the fermentation process. The duration of this proc- 
ess may vary inter alia on the protein product desired. As an example of such a gene, the gene encoding glucoamyiase 
(AG) is illustrative. Other genes of interest include fungal a-amylase, acid phosphatase, protease, acid protease, lipase. 

25 phytase and cellobiohydrolase. Especially preferred target lod are the glucoamyiase gene of A. nicer, the fungal amy- 
lase gene of A. oryzae. the cellobiohydrolase genes of X reesei. the acid protease gene of Mucor miehei. the lactase 
gene of Kluweromvces lactis or the invertase gene of Saccharomvces cerevisiae . 

The transcriptional termination regulatory region may be from the gene of interest, the target locus, or any other 
convenient sequence. Where the construct includes further sequences of interest downstream (in the direction of tran- 

30 scription) from the gene of interest, the transcriptional termination regulatory region, if homologous with the target 
locus, should be substantially smaller than the homologous flanking region. 

A selection marker is usually employed, which may be part of the expression construct or separate from the expres- 
sion construct so that it may integrate at a site different from the gene of interest. Since the recombinant molecules of 
the invention are preferably transformed to a host strain that can be used for industrial production, selection markers to 

35 monitor the transformation are preferably dominant selection markers, i.e.. no mutations have to be introduced into the 
host strain to be able to use these selection markers. Examples of these are markers that enable transformants to grow 
on defined nutrient sources (e.g. the A. nidulans amd S gene enables A. nioer transformants to grow on acetamide as 
the sole nitrogen source) or markers that confer resistance to antibiotics (e.g., the bje gene confers resistance to phle- 
omycin or the hQh gene confers resistance to hygromycin B). 

40 The selection gene will have its own transcriptional and translational initiation and termination regulatory regions to 
allow for independent expression of the marker. A large number of transcriptional initiation regulatory regions are known 
as described previously and may be used in conjunction with the marker gene. Where antibiotic resistance is employed, 
the concentration of the antibiotic for selection will vary depending upon the antibiotic, generally ranging from about 30 
to 300 ^g/ml of the antibiotic. 

45 The various sequences may be joined in accordance with known techniques, such as restriction, joining comple- 
mentary restriction sites and ligating. blunt ending by filling in overhangs and blunt ligation, Bal31 resection, primer 
repair, in vitro mutagenesis, or the like. Polylinkers and adapters may be employed, when appropriate, and introduced 
or removed by known techniques to allow for ease of assembly of the expression construct. At each stage of the syn- 
thesis of the construct, the fragment may be cloned, analyzed by restriction enzyme, sequencing or hybridization, or the 

50 like. A large number of vectors are available for cloning and the particular choice is not critical to this invention. Normally, 
ctaning will occur in E. coli . 

The flanking regions may include at least part of the open reading frame of the target locus, particularly the signal 
sequence, the regulatory regions 5' and 3' of the gene of the target locus, or may extend beyond the regulatory regions. 
Normally, a flanking region will be at least 100 bp, usually at least 200 bp, and may be 500 bp or more. The flanking 
55 regions are selected, so as to disrupt the target gene and prevent its expression. This can be achieved by inserting the 
expression cassette (comprising the nucleotide sequence to be expressed and optionally including additional elements 
such as a signal sequence, a transcriptional initiation regulatory region sequence and/or a transcriptional termination 
regulatory region sequence) into the open reading frame proximal to the 5' region, by substituting all or a portion of the 
target gene with the expression construct, or by having the expression construct intervene between the transaiptional 
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initiation regulatory region at the target locus and the open reading frame. As already indicated, where the termination 
regulatory region is homologous with the region at the target locus, the 3'-f lanking region should be substantially larger 
than a termination regulatory region present in the construct. 

The present invention also provides the starting material for the construction of 'second-generation' phytases, i.e. 
5 phytase enzymes with properties that differ from those of the enzyme isolated herein. Second-generation phytases may 
have a changed temperature or pH optimum, a changed specific activity or affinity for its substrates, or any other 
changed quality that makes the enzyme more suited for application in a defined process. E. coil is the best host for such 
mutagenesis (e.g. site-directed mutagenesis). Since E. coli lacks the splicing machinery for the removal of introns which 
might be present in the phytase gene, a cDNA clone of phytase is the sequence of choice to be expressed in E. coli . 
10 This cDNA sequence can be readily mutated by procedures well known in the art, after which the mutated gene may 
be introduced into the desired expression constructs. 

The construct may be transformed into the host as the cloning vector, either linear or circular, or may be removed 
from the cloning vector as desired. The cloning vector is preferably a plasmid. The plasmid will usually be linearized 
within about 1 kbp of the gene of interest. Preferably, the expression construct fa the production of the phytases of the 
IS present invention will be integrated into the genome of the selected expression host. 

A variety of techniques exist for transformation of f Oamentous fungi. These techniques include protoplast fusion or 
transformation, electroporation and micro-projectile firing into cells. Protoplast transformation has been found to be suc- 
cessful and may be used with advantage. 

Mycelium of the fungal strain of interest is first converted to protoplasts by enzymatic digestion of the cell wall in the 
20 presence of an osmotic stabilizer such as KCI or sorbitol. DNA uptake by the protoplasts is aided by the addition of 
CaCl2 and a concentrated solution of polyethylene glycol, the latter substance causing aggregation of the protoplasts, 
by which process the transforming DNA is included in the aggregates and taken up by the protoplasts. Protoplasts are 
subsequently allowed to regenerate on solid medium, containing an osmotic stabilizer and, when appropriate, a selec- 
tive agent, for which the resistance is encoded by the transforming DNA. 
25 After selecting for transformants, the presence of the gene of interest may be determined in a variety of ways. By 
employing antibodies, where the expression product is heterologous to the host, one can detect the presence of expres- 
sion of the gene of interest Alternatively, one may use Southern or Northern blots to detect the presence of the inte- 
grated gene or its transcription product. 

Amplification of the nucleotide sequence or expression construct of interest may be achieved via standard tech- 
30 niques such as, the Introduction of multiple copies of the construct in the transforming vector or the use of the amdS 
gene as a selective marker (e.g. Weinans et a[. (1985) Current Genetics, 9, 361 -368). The DNA sequence to be ampli- 
fied may comprise DNA which is either homologous or heterologous to the expression host, as discussed above. 

The cells may then be grown in a convenient nutrient medium. Low concentrations of a protease inhibitor may be 
employed, such as phenyimethylsulfonyl fluoride, a2-macro-globulins. pepstatin, or the like. Usually the concentration 
35 will be in the range of about 1 ^g/ml to 1 mg/ml. The protease gene(s) may be inactivated in order to avoid or reduce 
degradation of the desired protein. 

The transformants may be grown in either batch or continuous fermentation reactors, where the nutrient medium is 
isolated and the desired product extracted. 

Various methods for purifying the product, if necessary, may be employed, such as chromatography (e.g., HPLC), 
40 solvent-solvent extraction, electrophoresis, combinations thereof, or the like. 

The present invention also provides a downstream processing method in which the fermentation broth (optionally 
purified) is filtered, followed by a second germ-free f IKraton, after which the filtered solution is concentrated. The thus- 
obtained liquid concentrate may be used as follows: 

45 a) Phytase and other proteins may be precipitated from the liquid concentrate by adding acetone to a final volume 
of 60% (v/v) under continuous stirring. The precipitate may be dried in a vacuum at 35''C. After grinding the dry 
powder, the enzyme product may be used as such for application experiments. Recovery yields are about 90%. 
b) The liquid concentrate may be spray-dried using conventional spray-drying techniques. Recovery yields vary 
from 80 to 99%. 

50 c) The liquid concentrate may be mixed with cannier materials such as wheat bran. The thus-obtained mixture may 
be dried in a spray tower or in a fluid bed. 

d) The liquid concentrate may be osmotically stabilized by the addition of e.g. sorbitol. A preservative such as ben- 
zoic acid may be added to prevent microbial contamination. 

55 All four formulations may be sold to premix manufacturers, compound feed industries, other distributors and farm- 
ers. 

The examples herein are given by way of illustration and are in no way intended to limit the scope of the present 
invention. It will be obvious to those skilled in the art that the phytase gene of the invention can be used in heterologous 
hybridization experiments, directed to the isolation of phytase encoding genes from other microorganisms. 
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Example 1 

Fermentation of A. ficuum NRRL 3135 

5 Asoeraillus ficuum strain NRRL 3135 was obtained from the Northern Region Research Lab, USDA, 1815 North 
University Street. Peoria, Illinois, USA. Fungal spore preparations were made following standard techniques. 

Spores and subsequently cells were transferred through a series of batch fermentations in Erlenmeyer flasks to a 
10 1 fermentor. After growth in batch culture, the contents of this fermentor were used as inoculum for a final 500 liter 
batch fermentation. 

10 The media used contains: 91 g/l corn starch (BDH Chemicals Ltd.): 38 g/l glucose.HaO; 0,6 g/l MgS04.7H20; 0.6 
g/l KCt; 0.2 g/l FeS04.7H20 and 12 g/l KNO3. The pH was maintained at 4.6 ± 0.3 by automatic titration with either 4N 
NaOH or 4N H2SO4. 

Cells were grown at 28''C at an automatically controlled dissolved oxygen concentration of 25% air saturation. 
Phytase production reached a maximum level of 5-10 U/ml after 10 days of fermentation. 

IS 

Purification and characterization of A. ficuum phvtase 

20 A. Phvtase activity assay 

100 ^1 of broth filtrate (diluted when necessary) or supernatant orlOO ^1 of demiwater as reference are added to 
an incubation mixture having the following composition: 

25 - 0.25 M sodium acetate buffer pH 5.5, or 

- glycine HCL-buffer pH 2.5 

- 1 mM phytic acid, sodium salt 

- demiwater up to 900 pi 

30 The resulting mixture is incubated for 30 minutes at ZTC. The reaction is stopped by the addition of 1 ml of 10% 
TCA (trichloroacetic acid). After the reaction has terminated, 2 ml of reagent (3.66 g of FeS04 • 7H2O in 50 ml of ammo- 
nium molybdate solution (2.5 g (NH4)6Mo7024 * 4H2O and 8 ml H2SO4, diluted up to 250 ml with demiwater)) is added. 

The intensity of the blue color is measured spectro-photometrically at 750 nm. The measurements are indicative of 
the quantity of phosphate released in relation to a calibration curve of phosphate in the range of 0 - 1 mMol/1. 

35 

Phosphatase stain 

Components with phospatase activity were detected by isoelectric focusing using a general phosphatase stain. 
The gel was incubated with a solution of a-naphthylphosphate and Fast Garnet GBC salt (Sigma, 0.1 & 0.2% (wA^), 
40 respectively) in 0.6 M sodium acetate buffer pH 5.5. The reaction, which results in the appearance of a black precipitate, 
was either terminated with methanohacetic acid (30:10 %, v/v). or, should the protein having phytase activity be further 
required, by rinsing with distilled water. 

B. Purification of A. ficuum ohvtase 

45 

Phytase was purified to homogeneity from the culture broth of A. ficuum NRRL 3135. The broth was first made 
germ-free by filtration. The resulting culture filtrate was subsequently further concentrated in a Filtron ultrafiltration unit 
with 30 kD cutoff filters. The pH and ionic strength of the sample were adjusted for the purification procedure by washing 
the sample with 10 mM sodium acetate buffer pH 4.5. The final concentration in this ultrafiltration procedure was 
50 approximately 20 Ibid. 

The sample was then applied to a cation exchanger (S-Sepharose Fast-Flow in a HR 16/10 20 ml column, both 
obtained from Pharmacia) in a Waters Preparative 650 Advanced Protein Purification System. The proteins bound were 
eluted with a sodium chloride gradient from 0-1 M in the sodium acetate buffer. Phytase eluted at approximately 250 
mM NaCL Phytase activity containing fractions were pooled, concentrated and desalted by ultrafiltration. The resulting 
55 solution was applied to an anion exchanger (Q-Sepharose Fast-Flow in a HR 16/10 20 ml column, Pharmacia), and the 
proteins were again eluted by a sodium chloride gradient from 0-1 M in the acetate buffer described above. Phytase was 
eluted from this column at approximately 200 mM NaCI. 

The result of these purification steps is a partially purified phytase preparation with a specific activity of approxi- 
mately 40-50 U/mg protein, indicating a 25-fbld purification. 
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Analysis of the purity of the partially purified phytase indicated the presence of a major impurity with a molecular 
weight of approximately 100 kDa (Fig. 1B. sequence E). Isoelectric focusing indicated the presence of a number of 
phosphatase activity containing enzymes, including 3-4 phytase subfbrms (isoelectric points varying from 5.0-5.4) (Fig 
1 A, sequences A and B). 

5 In order to obtain a homogeneous phytase preparation, a further two-fold purification was achieved by a subse- 
quent separation of the components of the partially purified phytase by isoelectric focusing in a LKB Multiphor system 
on Amphoiine PAG plates (pH range 4-6.5). The proteins with phosphatase activity (including the phytase) were 
detected by the general phosphatase staining procedure described above. The bands of interest were subsequently 
excised from the gel and the active protein was eluted by a 16 hr incubation of the gel slices in 1 0 mM sodium acetate 

10 buffer 5.5. The protein fractions were analysed in the specific phytase activity assay, as described in Example 2, thus 
discriminating the phytase fractions from other acid phosphatases. The final purification factor for phytase was approx- 
imately 60 fold (specific activity of final preparation 100 U/mg protein). In this final purification step it was also possible 
to isolate different subforms of phytase (Fig. 1 A, sequences A and B). 

Monoclonal antibodies directed against the A. ficuum phytase were prepared, providing an effective purification 

15 procedure. The antibody was coupled to cyanogen bromide-activated Sepharose 4B (5 mg/ml gel), and this matrix was 
used in a immunoaffinity column. The matrix was shown to bind approximately 1 mg phytase per ml. The phytase could 
be eluted from the affinity column with a pH 2.5 buffer (1 00 mM glydne-HCI, 500 mM NaCI) without any loss of activity. 
This procedure can be used to isolate homogeneous phytase from a crude culture filtrate in one single step with an 80% 
recovery and a 60-f6ld purification. 

20 

C. Dealvcosvlation of phytase 

A. ficuum phytase (70 ^g protein) was incubated with 2.5 U N-Gycanase (Genzyme) in 0.2 M sodium phosphate 
buffer pH 8.6 and 10 mM 1,10-phenanthroline in a total volume of 30 ^. 
25 After 16 hrs at 37°C, the extent of deglycosylation was checked by electrophoresis (Phast System, Pharmacia). 
The apparent molecular weight of the phytase was found to decrease from 85 kDa to approximately 56.5 kDa. The peri- 
odic acid Schiff (PAS) sugar staining, which identifies native phytase as a glycoprotein, failed to detect any residual car- 
bohydrates attached to the protein. The complete removal of carbohydrate was further substantiated by the sensitive 
lectin-blotting method. Native and deglycosylated phytase (both 1.5 \lq) were run on a standard SDS-PAGE gel and 
30 electrophoretically transferred to a PVDF membrane (Immobilon, Millipore) in 25 mM TRIS-glycine buffer pH 8.3, 20% 
{visf) methanol, for a period of 16 hrs at 30V. 

The membrane was subsequently incubated with 1% (w/v) bovine serum albumin in phospate buffered saline and 
incubated with concanavalin A-peroxidase (Sigma, 10 pg/ml in phosphate buffered saline). The peroxidase was then 
stained with 4-chloro-l -naphthol (Sigma). 
35 This sensitive method also failed to detect any residual carbohydrate attached to the deglycosylated phytase. 
After deglycosylation, phytase has completely lost its activity, possibly due to aggregation of the enzyme. 

Example 3 

40 Determination of the amino add sequence of phytase and design of oligonucleotide probes 

A. Determination of the N-terminal amino acid sequence 

Phytase was electrophoretically transferred from SDS-PAGE or from lEF-PAQE onto a PVDF blotting membrane 
45 (Immobilon, Millipore). Electroblotting was performed in 10 mM CAPS (3-cyc!ohexylamino-propanesulfbnic add) buffer 
pH 1 1 .0, with 10% (vA^) methanol, for a period of 16 hrs. at 30V and 4''C. 

The protein was located with Coomassie Brilliant Blue staining. The band of interest was excised, further destained 
in methanol and subjected to gas-phase sequendng. The procedure has been carried out several times, using several 
individual preparations. The results obtained are given in Figure 1 A (sequences A and B). 
50 The amino acid sequence has also been determined for a 100 kDa protein that was present in crude preparations. 
The data obtained for this protein are given in Figure 1C. This sequence shows considerable homology with the add 
phosphatase that has been isolated from Asperg illus njgsc (MacRae fit ai. (1988) Gene Zl» 339-348). 

B. Determination of internal amino add sequences 

55 

Protein fragmentation by cyanogen bromide 

Phytase, purified to homogeneity, was transferred into 100 mM NaHCOs by ultrafiltration (Microconcentrator Cen- 
tricon 30, Amicon). The protein was subsequently lyophilized, dissolved in 70% trifluoroacetic acid (v/v), and incubated 
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for 6 hr with an approximately 300-fold molar excess of CNBr. The reaction was terminated by dilution of the mixture 
with water. The resulting fragments were again lyophilized. The sample was then dissolved in SDS-PAGE sample buffer 
containing DTT (dithiothreitol), and the extent of fragmentation was determined by PAGE. Analytical PAGE was per- 
formed on a Pharmacia Phast-System unit, on 20% SDS-PAGE gels. The gels were prerun to aeate a continuous 
5 buffer system to improve the separation of the small peptides (according to the manual). Peptides were detected using 
a silver-staining technique known in the art. since Coomassie Brilliant Blue failed to detect the smallest peptide. The 
result of the procedure was a complete degradation of phytase into peptides with molecular weights of < 2.5 kDa, 36 
kDa, 57 kDa and 80 kDa. 

The peptides were isolated for gas-phase sequencing by SDS-Tricine-PAGE as described by Schagger & Jagow 
10 (1987) Anal. Biochem. Ififi, 368-379 followed by electroblotting as described above. 

The N-terminus of the 57 kDa fragment is identical to the N-terminus of phytase as determined by Ullah (1988b, 
supra), with the exception of the first four amino acids which are absent (Figure 1A, sequence B). The N-terminal 
sequences of the 2.5 kDa and 36 kDa peptides are shown in Rgure 1 B as sequences A and B. 

75 C. Oligonucleotide probes 

Oligonucleotide probes have been designed, based on the amino acid sequences given in Figure 1A and IB, and 
were prepared using an Applied Biosystems ABI 380B DNA synthesizer. These oligonucleotides are given in Figure 2A 
and 2B. 

20 

Example 4 

Hybridization of genomic blots and ge nomic libraries with a first set of olioonucleotide probes 

25 Genomic DNA from A. f icuum has been isolated by grinding the mycelium in liquid nitrogen, using standard proce- 
dures (e.g. Yelton fil al (1 984) Proc. Natl. Acad. Sci.-U.S. A., 1 470-1 474). A genomic library was constructed in the bac- 
teriophage vector lambda EMBL3. using a partial Sau3A digest of AJ^mII NRRL 3135 chromosomal DNA, according 
to standard techniques (e.g. Maniatis st al- (1982) Molecular clo ning, a labor atory manual. Cold Spring Harbor Labora- 
tory, New York). The thus-obtained genomic library contained 60 to 70 times the A. f icuum genome. The library was 

30 checked for the occurrence of plaques without insert by hybrkJization with the lambda EMBL3 stuffer fragment. Less 
than 1% of the plaques were observed to hybridize to the lambda EMBL3 probe. The insert size was 1 3 to 1 7 kb. 

To identify conditions and probes that were suited for the screening of the genomic library, genomic DNA was 
digested with several restriction enzymes, separated on agarose gels and blotted onto Genescreen plus, using the 
manufacturers instructions. The blots were hybridized with all oligonucleotide probes. Hybridization was performed 

35 usings conditions of varying stringency (6 x SSC, 40 to eO'^C for the hybridization; up to 0.2 x SSC, 65**C for the wash- 
ing). Probes 1068 and 1024 (Figure 2A) were selected for the screening of the genomic library, although no common 
DNA fragments could be identified that hybridized specifically with both probes. Acid-phosphatase probe 1025 (Figure 
3) gave a specific and discrete hybridization signal and hence this probe was selected for screening the genomic library 
for the acid phosphatase gene. 

40 Using all three probes, hybridizing plaques could be identified in the genomic library. The hybridization signal cor- 
responding to probe 1025 (acid phosphatase) was strong and reproducible. Hybridization signals of variable intensity 
were observed using probes 1024 and 1068 (phytase). No cross hybridization between the two series was observed. 
All three series of plaques were rescreened and DNA was isolated from eight single, positive hybridizing plaques (Mani- 
atis et al.. supra). In each series, clones that contained identical hybridizing fragments coukJ be identified, indicating 

45 that the inserts of said clones are related and probably overlap the same genomic DNA region. Again, no cross-hybrid- 
ization could be demonstrated using the two phytase specific series (probes 1024 and 1068), indicating that, although 
both probes used to isolate the two series of clones were obtained from the N-terminal amino acid sequence of the pro- 
tein, different genomic DNA fragments had been identified and cloned. 

All three series of clones were hybridized with Northem blots containing mRNA isolated from induced and non- 

50 induced mycelium (Example 6). The add phosphatase-specrf ic clones, as well as the isolated internal 3. 1 kb Sail frag- 
ment from these clones, hybridized exclusively to induced mRNA samples. The mRNA identified by the acid phos- 
phatase-specific probes is about 1800 b in length, which agrees with the known size of the protein (68 kDa, Ullah and 
Cummins (1987) Prep. Biochem 1Z 397-422). No hybrkJization of the phytase-specific clones with specific mRNA*s 
could be demonstrated. We have thus concluded that the above-described method was unsuccessful in cloning the 

55 gene encoding phytase. It may be further concluded that this failure is not due to a failure in the method used, since the 
method has been successfully applied to identify the gene encoding acid phosphatase. The lambda clone containing 
the acid phosphatase gene was deposited on April 24, 1989 at the Centraal Bureau voor Schimmelcultures, Baarn, The 
Netherlands and has been assigned accession number CBS 214.89. A 10 kb BamHI fragment has been isolated from 
phage Z1 and subcloned into pUC19. This subclone contains the entire gene encoding acM phosphatase. The sub- 



EP0 779037A1 



clone, pAF M (Rgure was deposited on April 24. 1989 as CBS 213.89. 
Example 5 

5 Isolation of the oene encoding phvtase. usino a second set of oliaonucleotide probes 

Probes have been designed using the N-terminal amino acid sequence of CNBr-generated fragments (Figure 28. 
probes 1 295, 1 296 and 1 297) and have been hybridized with genomic DNA as described above. The feasibility of using 
these probes in the isolation of the gene encoding phytase was again studied by Southern hybridization of genomic 

10 blots with the probes. This time, hybridizing fragments of corresponding lengths could be identified, using all three 
probes, despite the fact that the probes have been derived from non-overlapping regions. No hybridization was found 
between the new set of probes and the clones that have been isolated using the first set of probes (Example 4). There- 
fore, the genomic library was rescreened using all three probes in separate experiments. A subset of the clones 
(lambda AF201 ,219, 241 and 243) isolated with each individual probe also hybridized with both other probes, indicating 

15 that in this case, using Ihe three different probes, clones were isolated from a single genomic region. Attempts were 
made to hybridize the newly isolated clones with probes 1024 and 1068. In both cases, no hybridization with the newly 
isolated clones was observed under conditions in which both probes had successfully hybridized to the clones which 
were isolated using these probes (see Example 4). This demonstrates that the newly isolated clones have no homology 
to the probes derived from the N-terminus of the purified phytase. 

20 A lambda EMBL3-clone. which hybridizes to all three probes (1295-1297), was named lambda AF201 (Figure 4) 
and was deposited on March 9, 1989 as CBS 155.89. 

A 5.1 kb Bam HI fragment of lambda AF201 (subcloned in pUC19 and designated pAF 2-3. see Figure 4), hybrid- 
izing to all three oligonucleotide probes, was used to probe a Northern blot. In this case, a discrete mRNA having a size 
of 1800 bases was identified. This mRNA was found only in induced mycelium. Similar results were obtained when the 

25 oligonucleotides were used as probes. Therefore, using the new set of probes, a common DNA fragment has been 
identified, which hybridizes specifically to an induced mRNA. The length of this mRNA (1800 b) is sufficient to encode 
a protein of about 60 kDa, which is about the size of the non-glycosylated protein. Qearly, the isolated fragments con- 
tain at least part of the gene encoding phytase. 

30 Exsjmpleg 

Isolation of "induced" and "non-induced" mRNA 

It is known from the literature that the synthesis of phytase by A. ficuum is subject to a stringent phosphate-depend- 
35 ent regulation (Han and Callagher (1987) J. Indust. Microbiol. 1, 295-301). Therefore, the demonstration that an iso- 
lated gene is subject to a similar regulation can be considered to support the evidence that the gene of interest has 
been cloned. 

In order to isolate mRNA that has been synthesized under both producing and non-producing conditions. A. ficuum 
NRRL 3135 was grown as follows. Spores were first grown overnight in non-inducing medium. The next day. the myc- 

40 elium was harvested, washed with sterile water and inoculated into either inducing or non-inducing medium. The 
medium used contains (per liter): 20 g corn starch; 7.5 g glucose: 0.5 g MgS04 • 7 H2O; 0.2 g FeS04 • 7 H2O; and 7.2 
g KNO3. For the induction of phytase, up to 2 g/l corn steep liquor was added to the medium, while non-inducing 
medium contains 2 g/l K2HPO4. The mycelium was grown for at least a further 100 hours. Samples were taken at 
selected intervals. Phytase production was followed by the phytase assay as described in Example 2A. Denatured 

45 mRNA was separated by electrophoresis and blotted onto Genescreen plus. The blots were hybridized with ^^P- 
labelled pAF 2-3 or with the isolated 3. 1 kb Sail fragment from pAF 1 -1 (acid phosphatase) from Example 4. The results 
are shown in Table 2. 

Positive hybridization of the phytase specific 5.1 kb Bam HI fragment and the acid phosphatase specific 3.1 kb Sail 
fragment with isolated mRNA is observed only when cells are grown under conditions which are known to induce the 
so synthesis of phytase and acid phosphatases. From these results it has been concluded that the isolated genes are reg- 
ulated as expected for phytase and acid phosphatases. 



55 
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Table 2 

5 Hybridization of Northern blots using the phytase- 

specific 5.1 kb BamH I fragment (A) or the acid phosphatase 
specific 3.1kb Sai l fragment (B) as a probe; a + indicates 

,0 the presence of the 1800 b phytase mRNA or the 1800 b acid 
phosphatase mRNA. The relative phytase activity was 
determined for the 24 hr. samples: induced cultures have 10 
times more phytase activity than non-induced cultures. 

15 

Time after Induced Non- induced 

inoculation 

20 

A 24 hours + 

B 24 hours + 



Example 7 

30 Evidence for the cloning of the ohvtase gene 

To obtain definitive proof for the successful isolation of the gene encoding phytase, and to study the feasibility of 
increasing the expression of the cloned gene, the phytase gene was subcloned into a suitable vector and transformed 
to A. niger 402 (ATCC 9092). To this end, the phytase gene was isolated from the lambda clone AF201 as a 10 kb Nrul 

35 fragment and cloned into the Shil site of the vector pAN 8-1 (Matlern, I.E. and Punt RJ. (1 988) Fungal Genetics News- 
letter 35, 25) which contains the bje gene (conferring resistance to phleomycin) as a selection marker. The resulting 
construct was named pAF 28-1 (Rgure 4) and was transformed to A. niqer 402 according to the procedure as described 
in Example 9, with the exception that the protoplasts were plated on Aspergillus minimal medium supplemented with 30 
^g phleomycin/ml and solidified with 0.75% agar. Single transformants were purified and isolated and were tested for 

40 production in shake flasks, as described in Examples 1 and 2. As controls, transformants possessing only the vector, 
as well as the untransformed host were also tested (Table 3). Only A. nicer 402 containing pAF 28-1 appeared to pro- 
duce a phytase that reacted with a specific monoclonal antibody directed against A. f icuum phytase. The phytase react- 
ing with this monoclonal antibody could be eluted from an immuno affinity column at pH 2.5 and was shown to be 
identical in molecular weight, degree of glycosylation. isoelectric point and specific activity to the A. f icuum phytase. 

45 This finding provides clear evidence that A. nioer 402 cells transformed with pAF 28-1 express a phytase that is virtually 
identical to the A. f icuum phytase. Similar expression was not observed in either type of control cells. 



Table 3 



Strain 


Phytase/Activity U/ml 


% of phytase-activity 






adsorbed onto the immu- 






noaffinity column 


A. niqer 402 


0.5 


0 


A. niqer 402 pAF 28-1 


0.7 


10 


A. niqer 402 pAN 8-1 


0.5 


0 



Strains were grown under induced conditions (Example 6). Samples were taken after 96 hours of growth. 
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Example 8 

Characterizatio n of the ohvtase gene. 

5 The lambda clones containing the phytase gene have been analyzed by digestion with various restriction enzymes. 
A map of the genomic region encompassing the phytase gene is given in Figure 4. Defined restriction fragments have 
been subdoned in the cloning vector pUC19, as indicated in Figure 4. 

It has previously been shown (Example 5) that the 5.1 kb Bam HI fragment present in pAF 2-3 encompasses at least 
part of the phytase gene. Moreover the oligonucleotide probes 1295 and 1297 (Figure 2B) were shown to hybridize to 

10 the £ai I insert from pAF 2-7 (positions of pAF 2 clones are presented in Figure 4), while probe 1 296 probably spans the 
Sai l site between the fragments in pAF 2-6 and pAF 2-7. The results of these experiments indicate that the phytase 
encoding sequence is located in the lefthand part of the Bam HI insert of pAF 2-3. 

Subsequently the nucleotide sequences of the inserts of plasmids pAF 2-3, pAF 2-6. and pAF 2-7 have been deter- 
mined completely using the didecncy chain termination method (Sanger si SL- (1977) Proc. Natl. Acad. Sci. USA 7± 

IS 5463-5467) and shotgun strategies described by Messing etii- (1981, Nuci. Acids Res. 9, 309-321). In addition specific 
oligonucleotides were synthesized based on nucleotide sequence information obtained during the sequencing proce- 
dure. 

The complete nucleotide sequence of clones pAF 2-3, pAF 2-6, and pAF 2-7 encompassing the chromosomal 
phytase gene locus is compiled in Figure 6, a graphic representation is given in Figure 7. 

20 Analysis of the protein coding capacity of the complete sequence revealed that the N-termlnal amino add 
sequence of the mature protein was encoded starting from nucleotide position 381 (the N-terminus disclosed by Ullah 
is located at position 369). Furthermore, the N-terminal amino acid sequence of the 36 kDa and 2.5 kDa internal pep- 
tide fragments (see Figure 1 B - sequences B and A) were found to be encoded at nucleotide positions 1101 and 1548, 
respectively. The open reading frame stops at nucleotide position 1713. 

25 These findings clearly prove the identity of the characterized chromosomal locus as containing phytase encoding 
DNA sequence. 

Directly upstream of the chromosomal sequence encoding the nfiature phytase protein, no ATG start codon can be 
found within the reading frame contiguous with the mature protein open reading frame; however, using intron-exon 
boundary characteristics, an intron can be postulated between nucleotide positions 254 and 355. bringing the ATG 
30 codon at nucleotide position 210 in frame with the mature phytase encoding open reading frame. The derived amino 
acid sequence of this N-terminal extension closely fits the rules for a secretion signal sequence as published by von 
Heyne (1983, Eur. J. Biochem. 133. 17-21). 

To confirm these hypotheses the phytase cDNA was isolated by PCR-amplification with specific phytase primers 
and a total mRNA/cDNA population as template according to the procedures described below. 

35 

Isolation off doIv A=^ RNA from Aspergillus ficuum. 

Total RNA was isolated from A. ficuum NRRL 3135 grown under induced conditions as mentioned in Example 6. 
Dry mycelium was frozen wHh liquid nitrogen and ground. Subsequently, the powder was homogenized in an Ultra-Tur- 

40 rax (full speed during 1 minute) in 3M LiCI, 6M urea at O'^C and maintained overnight at 4°C as described by Auffrey & 
Rougeon (EurJ.Biochem., 107, 303-314,1980). Total cellular RNA was obtained after centrifugation at at 16,000 g for 
30 minutes and two successive extractions with phenol: chloroform: isoamylalcohol (50:48:2). The RNA was precipi- 
tated with ethanol and dissolved in 1 ml 1 0 mM Tris-HCI (pH 7.4), 0.5% SDS. For poly A"^ selection the total RNA sample 
was heated for 5 minutes at 60 °C, adjusted to 0.5 M Nad and subsequently applied to an oligo(dT]-cellulose column. 

45 After several washes with a solution containing 10 mM Tris-HQ pH 7.4, 0.5% SDS and 0.1 M NaCI, the poly A+ RNA 
was collected by elution with 10 mM Tris-HCI pH 7.4 and 0.5% SDS. 

Preparation of thg mRNA/gPNA opmplffl 

50 For the synthesis of the first cDNA strand 5 ^g of poly A"^ RNA was dissolved in 1 6.5 H2O and the following com- 
ponents were added: 2.5 ^1 RNasin (30 U/^i); 10 ^1 of a buffer containing 50 mM Tris-HCI pH 7.6, 6 mM MgCi2 and 40 
mM KCl; 2 ^1 1 M KCI; 5 ^ 0.1 M DTT; 0.5 ^il oligo (dT)i2.i8 (2.5 mg/ml); 5 8 mM dNTP-mix; 5 ^il BSA (1 mg/ml) and 
2.5 ^1 Moloney MLV reverse transcriptase (200 U/ml). The mixture was incubated for 30 minutes at 37 ''C and the reac- 
tion was stopped by addition of 1 0 0.2 M EDTA and 50 \i\ H2O. An extraction was performed with chloroform and after 

55 centrifugation 1 1 0 mJ 5 M NH4AC and 440 ^1 ethanol were successively added to the supernatant. Precipitation of the 
mRNA/cDNA complex was performed in a dry ice/ethanot solution for 30 minutes. The mRNA/cDNA was collected by 
centrifugation. subsequently washed with 70% ice-cold ethanol and dissolved in 20 ^1 H2O. 
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Cloning of phvtase cDNA fragments 

Isolation of the cDNA-encoding phytase sequences were performed by the polymerase chain reaction (PCR] in two 
fragments. Four synthetic oligonucleotide primers were designed based on the genomic phytase sequence as pre- 
5 sented in Figure 6. 

Oligo 1 : 5-QGG.TAG. AAT.TCA.AAA. ATG.GGC.GTC.TCTGCTGTT.CTA-3' 

Oligo 2: 5'-AGTGAC.GAA.TTC.GTG.CTG.GTG.GAQ.ATQ.QTQ.TCG-3' 

Oligo 3: 5'-GAG.CAC.CAA.GCT.GAA.GGA.TCC-3' 

10 Oligo 4: 5'-AAA.CTG.CAG.GCG.TTG.AGT.GTG.ATT.GTT.TAA.AGG.G-3' 

Oligo 1 contains the nucleotide sequence downstream of the phytase ATG startcodon (position 210 to 231) flanked 
at the 5' border by an EcoRI-site: oligo 2 contains the nucleotide sequence immediately upstream of the Sail-site (posi- 
tion 11 29 to 11 09) also flanked by an additional EcoRI-site: oligo 3 contains the nucleotide sequence around the Bam HI 

15 -site (position 845 to 865) and oligo 4 contains a nucleotide sequence positioned downstream of the phytase stopcodon 
(position 1890 to 1867) flanked by an additional Pstl-site. 

The polymerase chain reactions were performed according to the supplier of Taq-polymerase (Cetus). As template 
the solution (1.5 ^1) containing the mRNA/cDNA hybrids (described above) was used and as primers 0.3 ^ of each of 
the oligos 1 and 2 in the reaction to amplify the N-terminal phytase cDNA part and oligos 3 and 4 in the reaction to 

20 amplify the C-terminal phytase cDNA part (see Figure 8). After denaturation (7 minutes at lOO^'C) and addition of 2 U 
Taq-poiymerase the reaction mixtures were subjected to 25 amplification cycles (each: 2' at SS'C, 3* at 72*'C, 1 ' at 94''C) 
in a DNA-amplif ier of Perkin-Elmer/Cetus. In the last cycle the denaturation step was omitted. After digestion (EcoRI for 
the N-terminal cDNA part and Bam HI and £stl for the C-terminal cDNA part), both cDNA fragments were cloned into 
the appropiate sites of pTZ18R (Promega). 

25 The nucleotide sequence of both obtained PCR fragments was determined using the dideoxy chain termination 
technique (Sanger, supra) using synthetic oligonucleotides designed after the chromosomal phytase gene sequence, 
as primers arid total amplified DNA as well as cloned cDNA fragments as template. Hie sequence of the cDN A region 
encoding the phytase protein and the derived amino acid sequence of the phytase protein are depicted in Figure 8. 
The cDNA sequence confirmed the location of the intron postulated above, and indicated that no other introns were 

30 present within the chromosomal gene sequence. 

The phytase gene encodes a primary translation product of 467 amino acids (MW 51091); processing of the pri- 
mary translation product by cleaving off the signal peptide results in a mature phytase protein of 444 (MW 48851) or 
448 (containing the first four N-terminal amino acids as published by Ullah, MW 49232) amino acids. 

35 Example 9 

Qverexpression of phvtas e in Asperoilli bv introduction of additional phvtase genomic DNA copies 
Construction expression vector oAF 2-2S 

40 

All constructs were made using standard molecular biological procedures, as described by Maniatis et al., (1982) 
Molecular cloning, A laboratory Manual, Cold Spring Harbor Laboratory, N.Y. 

An expression vector pAF 2-28 was made by subcloning the 6 kb Pyull DNA fragment of the phytase genomic 
clone lambda AF201 , into the Sma l-site of pUC19. The derived plasmid was designated pAF 2-2 (Figure 4). As selec- 
45 tion marker for the transformation to Aspergillus, the EcoRI/l^l DNA fragment of plasmid pGW325 (Werners K. 
(1986), Thesis. Agriculture University. Wageningen, The Netherlands) containing the homologous Aspergillus nidulans 
amd S gene, was inserted into the EcoRl/Kpnl sites of pAF 2-2. The resulting expression vector was designated pAF 2- 
2S and is shown in Rgure 9. 

so A. Overexpression of ohytase in A. ficuum NRRL 3135. 

The plasmid pAF 2-25 was introduced in A. ficuum NRRL 3135 using transformation procedures as described by 
Tilbum, J. et.al.(1 983) Gene 26, 205-221 and Kelly. J. & Hynes. M. (1985) EMBO J., 4. 475-479 with the following mod- 
ifications: 

55 

- mycelium was grown on Aspergillus minimal medium (Cove, D. (1966) Biochem. Biophys. Acta. 113. 51-56) sup- 
plemented with 1 0 mM arginine and 10 mM proline for 1 6 hours at 30 **C in a rotary shaker at 300 rpm; 

- only Novozym 234 (NOVO Industri), and no helicase, was used for formation of protoplasts; 

- after 90 minutes of protoplast formation, 1 volume of STC buffer (1 .2 M sorbitol. 10 mM Tris-HCI pH 7.5, 50 mM 
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CaCl2) was added to the protoplast suspension and centrifuged at 2500 g at 4''C for 10 minutes in a swinging- 
bucket rotor. The protoplasts were washed and resuspended in STC-buffer at a concentration of 10® cells/ml 
- plasmid DNA was added in a volume of 10 jil in TE buffer (10 mM Tris-HCI pH 7.5. 0.1 mM EDTA) to 100 jil of the 
protoplast suspension; 

5 - after incubation of the DNA-protoplast suspension at 0°C for 25 minutes, 200 ^1 of PEG solution was added drop- 
wise (25% PEG 4000 (Merck), 10 mM Tris-HCI pH 7.5, 50 mM CaCl2). Subsequently, 1 ml off PEG solution ( 60% 
PEG 4000 in 10 mM Tris-HCI pH 7.5, 50 mM CaCl2) was added slowly, with repeated mixing of the tubes. After 
incubation at room temperature, the suspensions were diluted with STC-buffer, mixed by inversion and centrifuged 
at 2000 g at 4''C for 10 minutes. The protoplasts were resuspended gently in 200 pJ STC-buffer and plated on 

10 Aspergillus minimal medium with 1 0 mM acetamide as the sole nitrogen source, 15 mM CsCI, 1 M sucrose, solid- 
ified with 0.75% bacteriological agar #1 (Oxold). Growth was performed at SS^'C for 6-10 days. 

Single transfbrmants designated SP4, SP7 and SP8 were isolated, purified and tested for phytase production in 
shake flasks, usi ng the process as described in Examples 1 and 2. As a control, transformants possessing only the vec- 
75 tor (anxiS gene in pUC19), as well as the untransfonned host were tested. 

Strains were grown under induced conditions (see Example 6) and samples were taken after 96 hours of growth. 
Analyses were performed by measuring the phytase activity (Table 4} and by isoelectric focusing polyaaylamide gele- 
lectrophoresis (lEF-PAGE). 

Samples of equal volume were taken from fermentations of A. ficuum and A. ficuum pAF 2-2S SP7, grown under 
20 identical conditions, and were applied onto an lEF-PAGE gel (pH-range 4.5-6, Phast-System, Pharmacia). The electro- 
phoresis was performed according to the instructions of the manufacturer. Subsequently, the gels were either stained 
with the general protein stain Coomassie Briliant Blue (Figure 10B), or with the general phosphatase activity staining 
described in Example 2 (Figure 1 0A). 

A sample of A. ficuum phytase, purified to homogeneity (via immunoaffinity chromatography as described in Exam- 
25 pie 7), was also applied either alone, or mixed with a culture supernatant. 

Phytase is present in the various samples in a number of isoforms (indicated with an asterisl^, as has been men- 
tioned in this invention. The two major isoenzymes are clearly visible in the purified phytase in lanes 3 and 4 with both 
staining procedures (A and B). The phytase bands are barely visible in the parent A. ficuum strain, and significantly 
increased in the pAF 2-28 SP7 transfbrmant strain. 

30 



Table 4 



Increase off phytase production by transfformation of A^ 

fj£lMllNRRL3135. 


Strain 


Phytase activity (U/ml) 


A. ficuum 


0.6 


A. ficuum + control plasmid 


0.6 


A. ficuum pAF 2-2S SP8 


7.6 


A. ficuum dAF 2-2S SP7 


6.7 


A. ficuum pAF 2-2S SP4 


4.3 



45 



B. Qverexpresslon of phvtase in A.niaer CBS 513.88. 

50 The expression vector pAF 2-2S was also introduced in A. nioer CBS 513.88 by transformation procedures as 
described for A. ficuum . Single transfbrmants were isolated, purified and tested for phytase production in shake flasks 
under induced growth conditions as described in Example 6. 

Phytase expression levels of some transfbrmants (designated as A. niqer pAF 2-25 # 8. # 20 and # 33) and control 
strains were performed as described in Example 9A and are shown in Table 5. 

55 A. nioer transformants have phytase expression levels comparable with A. ficuum transformants. In addition this 
result indicates that the A. ficuum phytase promoter is active in A. niqer. 

Further analysis was performed on culture medium of transfformant pAF 2-25 #8 by electrophoresis on an lEF- 
PAGE gel in the pH range of 4.5-6 on a Phast-System (Pharmacia) as described above. Equal volumes of the culture 
supernatants of the A. niqer parent strain and of the transformant pAF 2-25 #8. grown under identk:al conditions, were 
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applied onto the gel. The gels were run and subsequently stained as above. 

The parent A. niaer produces a very low amount of phytase, which could not be detected by gel electrophoresis. 
The strain pAF 2-25 #8 produces approx. 90 times more phytase. and this difference is clearly visible in Figure 1 1 . 

Several isoforms of the phytase enzyme are detected (indicated by asterisk). The general protein stain indicates 
5 that the intensity of the phytase protein bands is dramatically increased, while no other major protein bands appear. 



Tables 



Phytase production bv transformation of A. niqer CBS 
513.88 with PAF2-2S 


Strain 


Phytase activity (U/ml) 


A. niqer 


0.2 


A. niqer + control olasmid 


0.2 


A. niqer pAF 2-2S#8 


14 


A. niqer pAF 2-2S#33 


5 


Aj!igfirpAF2-2S#20 


4 



20 



Exanrple 10 

25 

Phytase expression in A. niqer transfo rmed with expression vectors containing the A. f icuum ohvtase aene fused to the 
promoter and/or sional sequences of the A. niaer amyloqiucosidase (AG) gene. 

Constructions of the expression vectors. 

30 

To obtain overexpression of phytase in A. nicer, additional expression cassettes are derived in which the A. f icuum 
phytase gene is under control of the A. niqer amyloglucosidase (AG) promoter in combination with different signal 
sequences. In pi 8FYT3 and p24FYT3 the respective 1 8 and 24 amino acid (aa) leader sequences of the AG gene from 
A. niqer are fused to the phytase gene fragment encoding the mature protein. In the expression cassette pFYT3 the AG 
35 promoter sequence is fused to the phytase encoding sequence including the phytase leader sequence. 

Con$trMCtionQfp18FYT3 

Fusion of the AG-promoter and the 18 aa AG-leader sequence to the phytase sequence encoding the mature pro- 
40 tein were performed by the Polymerase Chain Reaction method. In the PCR reactions two different templates were 
used: pAF 2-25 containing the entire phytase gene as described above and pAB6-1. a plasmid which contains the 
entire AG-locus from A. nicer, which was isolated from a A. niqer plasmid library, containing 13-15 kb Hindlll fragments 
in pUC19. For the isolation, AG-specific oligos were used: 

45 AG-1 : 5'-GACAATGGCTACACCAGCACCGCAACGGAGATTGTTTGGCCC-3' 
AG-2: 5'-AAGCAGCCATTGCCCGAAGCCGAT-3' 

both based on the nucleotide sequence published for A. niaer (Boel ^ a!. (1984), EMBO J. 3, 1097-1 102; Boel fit al- 
(1984), Mol. and Ceil. Biol 4. 2306-2315). The oligonucleotide probes were derived from the sequence sun-ounding 
50 intron 2: oligo AG-1 is boated 3' of the intron and has a polarity identical to the AG mRNA and oligo AG-2 is found 
upstream of intron 2 and is chosen antiparallel to the AG mRNA. Plasmid pAB6-1 contains the AG gene on a 14.5 kb 
Hindlll fragment (see Figure 12). 

As primers for the PCR-amplrf ications lour synthetic oligonucleotides were designed with the following sequence: 

55 Oligo 1 : 5'-CTCTGCA GGAATTC AAGCTAG-3' (an AG-specific sequence around the Eco Rt site approx. 250 bp 
upstream the ATG initiation codon). 
Oligo 1 8-2: 5'-CGAGGCGGQGACTGCCAGTGCCAACCCTGTGCAGAC-3^ mature phytase (-1- > 1 8 AA AG-leader 
Oligo 18-3: 5'-GTCTGCACAGGGTTGGCACTGQCAGTCCCCGCCTCG-3' 18 aa AG-leader (-_L-> mature phytase 
Oligo 4: 5'-GGCACGAQQ^[QQTTCAGCTT-3' (a phytase specific sequence located at the Bam HI site on posi- 
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tion 861) 

The PCR was performed as described by Saiki el ai. (1988). Science 239, 487-491 , with minor modifications (see 
Example 8). 

To fuse the AG sequences to the phytase coding sequences two separate PCR's were carried out: the first reaction 
with pAB6-1 as template and oligos 1 and 18-2 as primers to ampHfy a 300 bp DNA fragment containing the 3'-part of 
the AG promoter and the 18 aa AG-leader sequence flanked at the 3'-border by the nucleotides of the phytase gene, 
and the second reaction with pAF 2-28 as template and oligos 18-3 and 4 as primers to amplify a 600 bp DNA fragment 
containing the 5' part of the phytase gene flanked at the 5'-border by 18 nucleotides of the AG signal peptide. A sche- 
matic view of these amplifications is presented in Figure 13. 

The two DNA fragments generated were purified by gelelectrophoresis and ethanol precipitation and used as tem- 
plates in the third PCR with oligos 1 and 4 as primers to generate the AG-phytase fusion. Hie obtained DNA fragment 
was digested with EcoRI and Bam HI and subcloned into pTZI 8R. The resulted fusion was sequenced and designated 
P18FYT1. 

The remaining (3.5 Kb) upstream region of the AGi)romoter was obtained by digestion of pAB6-1 with KQnl and 
partially with EcoRI and ligated to the 1.1 Kb EcoRI /Bam HI fragment of p18FYT1 and subsequently cloned into the 
Kpn I /Bam HI sites of pTZ18R. Plasmid p18FYT2 thus obtained is shown in Figure 15. 

An additional Hindi II restriction site was Introduced by insertion of the synthetic fragment: 

5' AATTCAAGCTTG 3' 
3' GTTCGAACTTAA 5' 

into the EcoRI-site (flanking the amdS -qene) of pAF 2-2S. Tlie obtained plasmid was designated pAF 2-2SH (Figure 

1 4) and is used as starting plasmid to exchange the phytase promoter sequences by the PCR AG-phytase fusion DNA 
fragments. 

For the final construction, p18FYT2 and pAF 2-2SH were digested with Kpnl and partially with Bam HI. The 4.6 kb 
DNA fragment of p18FYT2 and the 1 1 kb DNA fragment of pAF 2-2SH were isolated and purified by gel electrophore- 
sis, subsequently ligated and transferred to E. coli . Hie derived expression cassette was designated p18FYT3 (Figure 

15) . 

Construction of d24FYT3 

Fusion of the AG-promoter and the 24 aa AG leader sequence to the mature phytase encoding sequence was per- 
formed by PCR-ampiification as described above for the construction for p18FYT3 with the exception of the primers 
used. Two new primers were synthesized with the following sequence: 

Oligo 24-2: 5'-CGAGCCGGGGACTGCCAGGCGCTTGGAAATCACATT-3' mature phytase 24 AA AG-leader 
Oligo 24-3: 5'-AATGTGATTTCCAAGCGCCTGGCAGTCCCCGCCTCG-3* 24 aa AG-leader (-1- ) mature phytase 

Two separate PCR's were can-ied out: the first reaction with pAB 6-1 as template and oligos 1 and 24-2 as primers to 
amplify a 318 bp DNA fragment containing the 3'-part of the AG promoter and the 24 aa AG leader sequence flanked 
at the 3'-border by 18 nucleotides of the phytase gene and the second reaction with pAF 2-2S as template and oligos 
24-3 and 4 as primers to amplify a DNA fragment containing the 5'-part of the phytase gene flanked at the 5'-border by 
18 nucleotides of the 24 aa AG leader. A schematic view of these amplifications is presented in Figure 13. 

For the construction of the final expression cassette p24FYT3 via the intermediate plasmids p24FYT1 and 
p24FYT2, the same cloning pathway/procedure was used as described for p18FYT1 and p18FYT2 to derive the 
expression cassette p18FYT3 (Figure 15). 

Construction of DFYT3 

Fusion of the AG-promoter to the phytase gene (including the phytase leader) sequence was also performed by 
PCR-amplification as described above for the construction of p18FYT3 with the exception of the primers used. Two 
additional primers were generated with the following sequence: 

Oligo fyl-2: 5*-AACAGCAGIAGACGCCCATTGCTGAGGTGTAATGATG-3* phytase leader <-X- > AG-promoter 
Oligo fyl-3: 5'-CATCATTACACCTCAGCAATGGQCGTCTCTGCTGTT-3. AG-promoter <-l- ) phytase leader 

Two separate PCR's were carried out: the first reaction with pAB 6-1 as template and oligos 1 and fyt-2 as primers 
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to amplify a 282 bp DNA fragment containing the 3'-part of the AG promoter flanked at the 3'-border by 18 nucleotides 
of the phytase leader and the second reaction with pAF 2-2S as template and oligos fyt-3 and 4 as primers to amplify 
a DNA-fragment containing the 5'-part of the phytase gene (including the phytase leader) and flanked at the 5'-border 
by 18 nucleotides of the AG-promoter. A schematic view of these amplifications is presented in Figure 13. 
5 For the construction of the final expression cassette pFYT3 along the intermediate plasmids pFYTI and pFYT2, 
the same cloning pathwayyjsrocedure was used as described for p18FYT1 and p18FYT2 to derive the expression cas- 
sette pi 8FYT3 (Figure 15). 

Expression of the phvlase aene under the control of the AG promoter in A. nicer 

10 

E. coli sequences were removed from the phytase expression cassettes described above by hOndill digestion. 
Aftenwaixls, the A. niaer strain CBS 51 3.88 (deposited October 10,1 988) was transformed with 1 0 jig DNA fragment by 
procedures as desaibed in Example 9. Single A. nicer transfamants from each expression cassette were isolated, and 
spores were streaked on selective acetamide-agar plates. Spores of each transformant were collected from cells grown 
15 for 3 days at 37 ^'C on 0.4% potato-dextrose (Oxoid, England) agar plates. Phytase production was tested in shake 
flasks under the following growth conditions: 

Approximately 1 x 10^ spores were inoculated in 100 ml pre-culture medium containing (per liter): 1 g KH2PO4; 30 
g maltose; 5 g yeast-extract; 10 g casein-hydrolysate; 0.5 g MgS04 « 7H2O and 3 g Tween 80. The pH was adjusted to 
5.5. 

20 After growing overnight at 34 ""C in a rotary shaker, 1 ml of the growing culture was inoculated in a 100 ml main- 
culture containing (per liter): 2 g KH2PO4; 70 g malto-dextrin (Maldex MDO3, Amylum); 12.5 g yeast-extract; 25 g 
casein-hydrolysate; 2 g K2SO4; 0,5 g MgS04 • 7H2O; 0.03 g Zna2; 0.02 g CaCl2; 0.05 g MnS04 • 4 H2O and FeS04. 
The pH was adjusted to 5.6. 

The mycelium was grown for at least 1 40 hours. Phytase production was measured as described in Example 2. The 
25 production results of several, random transformants obtained from each expression cassette are shown in Table 6. 



30 



35 



40 



45 



so 



55 
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Tables 





Phytase production of several A. nioer CBS 513.88 strains transformed with plasmids containing the 


5 


A. f iQMMm phytase gene under control of the A. niqer AG-oronioter in conibination with different 




leader sequences. 






Expression cassette 


Transformant # 


Phytase activity (U/ml) 




p18FYT3 (AG-promoter/18 aa AG-leader) 


p18FYT3#240 


82 


10 




pl8FYT3#242 


84 






p18FYT3#243 


62 






p18FYT3#244 


43 


15 




p18FYT3#245 


80 






p18FYT3#246 


82 






p18FYT3#250 


110 




p24FYT3 (AG-promoter^4 aa AG-leader) 


p24FYT3#256 


8 


20 




p24FYT3#257 


30 






p24FYT3#258 


13 






p24FYT3#259 


33 


25 




p24FYT3#260 


17 






p24FYT3#261 


28 






p24FYT3#262 


18 


30 




p24FYT3#265 


12 


pFYTS (AG-promoter/phytase leader) 


pFYT3#205 


50 






pFYT3 # 282 


280 






pFYT3#299 


96 


35 




pFYT3#302 


220 






pFYT3#303 


175 






pFYT3#304 


150 


40 




pFYT3#305 


150 




pFYT3#312 


140 



45 The data clearly show high phytase expression levels in A. nioer translormants containing the phytase gene under 
the control of the A. niq er AG promoter. The data also show that the highest phytase production is obtained with the 
pFYT3 expression vector, which contains the phytase leader sequence. Similar ^ression vectors containing an intron- 
less phytase gene after transformation to A. nioer. resulted in phytase expression levels comparable to pFYT3 trans- 
fbrmants of A. nioer. 

so In addition, electrophoresis on an lEF-PAGE gel in the pH-range of 4.5-6 was performed on culture supematants 
of transformants pFYT3 #205 and #282. Equal volumes of the culture supematants of the A. nicer parent strain and of 
both transfornr^nts, grown under identical conditions, were applied onto the gel. run and sibsequently stained as 
described in Example 9. The parent A. nioer produces a very low anfiount of phytase, which is not detected in this exper- 
iment. The strains pFYT3 #205 and #282 produce approx. 250 and 1 400 times more phytase (compare phytase levels 

55 in Tables 4 and 5). and this difference is clearly visible in Figure 11. Several isoforms of the phytase enzyme are 
detected (indicated by an asterisk). The general protein stain indicates that the intensity of the phytase protein bands 
is dramatically increased, while no other major protein bands appear. 
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Example H 

Overexpression of phvtase in A. ticuum and A. niaer grown on an itxlustrial scale 
s A. A. ficuum 

Strain A. ficuum pAF 2-2S #4 and A. ficuum NRRL 3135 were grown as desaibed in Example 1 . The transfbrmant 
produced approximately 50 times more phytase as compared to the wild-type strain. 

«> Table 7 

Overexpression of phytase by a transformant of 
,5 A. ficuum containing multiple phytase genes. Cells were 
grown as described in Example 1. 

Hours after Phytase activity (U/ml Fermentation broth) 
^ inoculation A. ficuum NRRL 3135 A. ficuum pAF 

2-2S #4 



25 



0 
24 
92 
141 



0 
0 
2 
5 



0 
0 
142 
270 



30 

B. Ajiiger 

Strain A. nig er pAF 2-2S #8, a transformant of A. niq er strain CBS 513.88 and the parent A. niq er strain itself were 
35 grown as described in Example 1 . The transformant produced approximately 1 000 times more phytase as compared to 
the original A. niqer parent strain (Table 8). 

Table 8 

40 

Overexpression of phytase by a transformant of A. niaer 
(CBS 513.88) containing multiple phytase genes. Cells were 
grown as described in Example 1. 



Hours after Phytase activity (U/ml fermentation broth) 

inoculation A. niaer CBS 513.88 A. niaer pAF 2.2 #8 

50 

0 0 

0 5 

0.1 65 

0.1 95 



55 



0 
24 
92 
141 
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Example 12 

To contruct the vector pREPFYTS. with which simultaneously phytase expression and AG gene replacement is 
achieved, pFYT3 is digested with KdqI. With the obtained linear j^l DNA fragment two separate ligations are per- 
5 formed. 

Ligation 1 with the KpnI-Hindlli adaptor: 



5'- CGGGGA -3' 
3 ' - CATGG CCC CTTCGA - 5 ' 
Kpn l Hindi I I 



15 Ligation 2 with the Kpn I-Hindlll* adaptor, in which the Hindlll restriction site will not restore after ligation: 

5'- CGGGGG -3' 
3 ' - CTAGG CCC CCTCGA - 5 ' 
Kpn l Hindi! I* 



Subsequently, ligation 1 is partially digested with Hindlll. After removal of the MdS containing fragment by gel 
electrophoresis, the remaining DNA fragment is recircularized by ligation and transferred to E. coli . The obtained plas- 
25 mid is denoted pPtTS amdS (see Figure 16). 

Ligation 2 is also digested with Hindlll and the 4 kb DNA Hindlll/Hindlll* fragment, containing the amdS gene, is 
isolated by gel electrophoresis, subsequently ligated to a partially hyndlll digest of pFYT3 anrKiS and transferred to 
E.coli . The plasmid containing the amd S gene at the 3'end of the phytase gene is denoted pFYTSINT (see Figure 17). 
To introduce the approx. 6 kb Sall/Hindlll DNA fragment of pAB6-1, containing the 3'-flanking AG sequence, 
30 pFYT3INT is partially digested with HliKilll, ligated first to the adaptor: 

5' -AGCTAGGGGG -3' 

3 ' - TCCC CCAGCT - 5 ' 

Hindlll* Sai l 

35 

(in which the Hindlll* restriction site will not restore after ligation) and subsequently with the Sall/Hindlll fragment of 
pAB6-1. After transformation to E. coli. the desired plasmid pREPFYT3, containing the 3' AG flanking sequence at the 
correct position, is obtained (Figure 18). 

40 

Expression of ohvtase in A. nioer by AG gene replacement . 

Before transformation of A. niqer with pREPFYTS, the E. coli sequences in the plasmid are removed by Hindlll 
digestion and gelelectrophoresis. The A. nioer strain CBS 513.88 is transformed with 10 ^g DNA fragrnent by proce- 
ss dures as described in Example 9. Selection and growth of transformants is performed as described in Example 9. only 
a minority of the selected transformants lose AG activity (approx. 20%). Southern analysis of chromosal DNA is per- 
formed on AG negative and phytase positive fransformants to verify that the AG gene is indeed replaced by the phytase 
gene. 

50 Example 13 

Consen/ation of the phytase gene in different species. 

To determine whether the phytase gene is highly conserved within microbial species, Southern analyses of chro- 
55 mosomal DNA from ten different species were performed with the A. f icuum phytase cDNA as probe. 

These chromosomal DNA analyses were performed on species from filamentous fungi, yeasts and bacteria. As an 
example, only a limited number from each group were chosen: for filamentous fungi, Penicillium chrvsoaenum and 
Asperg illus niqer: for yeast, Saccharomyces cerevisiae and Kluyveromyces lactis; and for the procaryotic organisms the 
Grami3ositive species, Bacillus subtilis. Clostridum thermocellum. and Sfreptomyces Ibddaos and as an example for a 
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gram-negative bacterium Pseudomonas a eruginosa. 

High molecular weight chromosomal DNA from these species was digested with Pyull and BamH l separately and 
subsequently electrophorlzed on a 0.7% agarose gel. 

After transfer to nitrocellulose filters, the hybridization was performed overnight at low stringency (6 x SSC; SO'^C) 
5 with a ^^P-labeled 5'-phytase cDN A fragment (described in Example 8). Blots were washed in 6 x SSC at room temper- 
ature and exposed to X-ray for 18 hours. 

As shown in Figures 19 a and b, descrete bands are observed in almost every lane, predicting a high degree of 
homology of the phytase gene between microbial species. 

10 Claims 

1 . A composition comprising a fungal phytase which catalyses the liberation of at least one inorganic phosphate from 
a myoinositol phosphate, and wherein the phytase is encoded by a DNA sequence that hybridizes under conditions 
of low stringency (6 x SSC; SO^'C overnight) with a probe comprising nucleotide positions 1 - 818 of Figure 8. and 

15 characterized in that the composition is substantially free of an Aspergillus acid phosphatase with an apparent 
molecular weight on SDS-PAGE of 100 kDa and with at its N-terminus the amino acid sequence: Val Val Asp Glu 
Arg Phe Pro Tyr Thr Gly 

2. A composition according to daim 1 , further characterized in that the phytase has a pH optimum at pH 5.5. 

20 

3. A composition according to claims 1 or 2, further characterized in that the phytase has a second pH optimum at pH 
2.5. 

4. A composition according to any one of claims 1 - 3, further characterized in that the phytase has a specific activity 
25 of about 100 U/mg. 

5. A composition according to any one of claims 1 - 4, characterized in that the phytase is encoded by an Asperqillus 
Oigec or Aspergillus ficuum gene. 

30 6. A composition according to any one of claims 1 - 5, characterized in that the composition has a specific activity of 
100 U/mg. 

7. A process for the preparation of a feed for animals characterized in that a composition as defined in any one of 
claims 1 - 6 is used. 

35 

8. Use of the composition according to any one of claims 1 - 6 for the liberation of a least one inorganic phosphate 
from a myoinositol phosphate. 

9. A process for promoting the growth of animals characterized in that an animal is fed a diet which comprises a com- 
40 position according to any one of claims 1 - 6. 

10. A process tor the reduction of levels of phytate in animal manure characterized in that an animal is fed a diet which 
comprises a composition according to any one of claims 1 - 6, wherein the phytase is present in an amount effective 
in liberating inorganic phosphate from phytate contained in the feedstuff. 

45 



SO 



55 
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N-TERMINAL AMINOACID SEQUENCES 



Position 




B 




A 


C 


01 






LEU 


02 






ALA 


03 






VAL 


04 






PRO 


05 




ALA 


ALA 


06 




SER 


SER 


07 






ARG 


08 







ASN 


09 


GLN 


GLN 


GLN 


10 


SER 


SER 


SER 


11 


SER 


SER 


SER 


12 






GLY 


13 


ASP 


ASP 


ASP 


14 


THR 


THR 


THR 


15 


VAL 


VAL 


VAL 


16 


ASP 


ASP 


ASP 


17 


GLN 


GLN 




18 




GLY 




19 




TYR 




20 




GLN 




21 




ARG 




22 




PHE 




23 




SER 




24 




GLU 




25 




THR 




26 




SER 




27 




HIS 




28 




LEU 




29 




ARG 




30 




(GLY)* 




31 




GLN 




32 




TYR 




33 




ALA 




34 




PRO 




35 




PHE 




36 




PHE 




37 




(ASP) 




38 




LEU 




39 




ALA 





Figure la 
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PEPTIDE AMINOACID SEQUENCES 





A 

r\ 


□ 

D 


p 

V/ 


n 
u 


c 


rosition 












U1 




Vinr/ 


Mb 1 


Al A 

ALA 


VAL 


no 






MCT 


CCD 


VAL 


HQ 


vaLIN 


Due 

rnb 


/2I Kl 

VaLN 




A on 

ASP 


U4 


Al A 

ALA 


A OD 

Aor 




A 1 A 

ALA 




uo 


HA 1 1 
vaLU 


TUD 
1 nri 


fit M 

VaLIM 


^1 1 1 
VaLU 


A DO 

Anva 


UO 


ULIM 


II C 
ILC 


Al A 

ALA 


LYS 


PHc 


07 


vaLU 


CCD 

ben 


GLU 


QLY 


PRO 


HQ 

Uo 


rKU 


THR 


CaLN 


TYR 


TV/ 

TYR 


09 


LEU 


SER 


GLU 


ASP 


THR 


in 
lU 


\/ Al 

VAL 


THR 


PRO 


LEU 


QLY 


1 1 


VAKw 


VAL 


1 CI 1 

LcU 


\/ A 1 

VAL 




i O 
1^ 


\/ Al 

VAL 


A OD 

Aor 


VAL 


\ / A 1 

VAL 


A 1 A 

ALA 


13 


LEU 


THR 


ARG 






l4 


VAL 


1 \/r> 

LYS 


\/ A 1 

VAL 






15 


A OKI 

AoN 


LEU 


LEU 






16 


(ASP) 


SER 


VAL 






17 


(ARG) 


PRO 


ASN 






18 


(VAU 


PHE 


ASP 






19 


VAL 


(CYS) 


ARG 






20 


PRO 


(ASP) 








21 




LEU 








22 




PHE 








23 




THR 









Figure 1b 
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N-TERMINUS 100KD PROTEIN 

Position 

01 VAL 

02 VAL 

03 ASP 

04 GLU 

05 ARG 

06 PHE 

07 PRO 

08 TYR 

09 THR 

10 GLY 



Figure 1c 
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Figure 5 
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Figure 6 (sheet 1 of 7) 

• • • « • 

GTCGACTTCCCGTCCTATTCGGCCTCGTCCGCTGAAGATCCATCCCACCA 
Sail 

• • • ♦ • 

TTGCACGTGGGCCACCTTTGTGAGCTTCTAACCTGAACTGGTAGAGTATC 100 

• « • • • 

ACACACCATGCCAAGGTGGGATGAAGGGGTTATATGAGACCGTCCGGTCC 

• • • • ■ 

GGCGCGATGGCCGTAGCTGCCACTCGCTGCTGTGCAAGAAATTACTTCTC 200 

• • • • • 

ATAGGCATCATGGGCGTCTCTGCTGTTCTACTTCCTTTGTATCTCCTGTC 
translation start 

• • • • • 

TGG GTATGCTAAGCACCACAATCAAAGTCTAAT3AGGACCCTCCCTTCCG 300 

start< 

• • • • • 

AGGGCCCCTGAAGCTCGGACTGTGTGGGACTACTGATCGCTGACTATCTG 
— intron 

• « • • • 
TGCAG AGTCACCTCCGGACTGGCAGTCCCCGCCTCGAGAAATCAATCCAG 400 
->end 

TTGCGATACGGTCGATCAGGGGTATCAATGCTTCTCCGAGACTTCGCATC 

• • • • • 

TTTGGGGTCAATACGCACCGTTCTTCTCTCTGGCAAACGAATCGGTCATC 500 

TCCCCTGAGGTGCCCGCCGGATGCAGAGTCACTTTCGCTCAGGTCCTCTC 

• • ■ • • 

CCGTCATGGAGCGCGGTATCCGACCGACTCCAAGGGCAAGAAATACTCCG 600 

• • ■ * • 

CTCTCATTGAGGAGATCCAGCAGAACGCGACCACCTTTGACGGAAAATAT 

• « ■ • • 

GCCTTCCTGAAGACATACAACTACAGCTTGGGTGCAGATGACCTGACTCC 700 

• • • • • 

CTTCGGAGAACAGGAGCTAGTCAACTCCGGCATCAAGTTCTACCAGCGGT 

• • • • • 

ACGAATCGCTCACAAGGAACATCGTTCCATTCATCCGATCCTCTGGCTCC 800 

AGCCGCGTGATCGCCTCCGGCAAGAAATTCATCGAGGGCTTCCAGAGCAC 

• • • • • 

CAAGCTGAAG6ATCCTCGTGCCCAGCCCGGCCAATCGTCGCCCAAGATCG 900 
BaioHI 

• • • • • 

ACGTGGTCATTTCCGAGGCCAGCTCATCCAACAACACTCTCGACCCAGGC 

• • • • • 

ACCTGCACTGTCTTCGAAGACAGCGAATTGGCCGATACCGTCGAAGCCAA 1000 
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Figure 6 (sheet 2 of 7) 

• • • • • 

TTTCACCGCCACGTTCGTCCCCTCCATTCGTCAACGTCTGGAGAACGACC 

• • • • « 

TGTCCGGTGTGACTCTCACAGACACAGAAGTGACCTACCTCATGGACATG 1100 

• • • • • 

TGCTCCTTCGACACCATCTCCACCAGCACCGTCGACACCAAGCTGTCCCC 

sail 

• • • • • 

CTTCTGTGACCTGTTCACCCATGACGAATGGATCAACTACGACTACCTCC 1200 

• ■ • • • 

AGTCCTTGAAAAAGTATTACGGCCATGGTGCAGGTAACCCGCTCGGCCCG 

• • • • • 

ACCCAGGGCGTCGGCTACGCTAACGAGCTCATCGCCCGTCTGACCCACTC 1300 

• • • • « 

GCCTGTCCACGATGACACCAGTTCCAACCACACTTTGGACTCGAGCCCGG 

• • • • • 

CTACCTTTCCGCTCAACTCTACTCTCTACGCGGACTTTTCGCATGACAAC 1400 

GGCATCATCTCCATTCTCTTTGCTTTAGGTCTGTACAACGGCACTAAGCC 

GCTATCTACCACGACCGTGGAGAATATCACCCAGACAGATGGATTCTCGT 1500 

• • • • • 

CTGCTTGGACGGTTCCGTTTGCTTCGCGTTTGTACGTCGAGATGATGCAG 

• • • • • 

TGTCAGGCGGAGCAGGAGCCGCTGGTCCGTGTCTTGGTTAATGATCGCGT 1600 

• • • • • 

TGTCCCGCTGCATGGGTGTCCGGTTGATGCTTTGGGGAGATGTACCCGGG 

• • • • • 

ATAGCTTTGTGAGGGGGTTGAGCTTTGCTAGATCTGGGGGTGATTGGGCG 1700 

• • • • a 

GAGTGTTTTGCTTAGCTGAATTACCTTGATGAATGGTATGTATCACATTG 
translation stop 

C ATATCATTAGCACTTCAGGTATGTATTATCGAAGATGTATATCG AAAGG 1800 

ATCAATGGTGACTGTCACTGGTTATCTGAATATCCCTCTATACCTCGTCC 

• • • • • 

CACAACCAATCATCACCCTTTAAACAATC ACACTCAACGC ACAGCGTACA 1900 

• • • • * 

AACGAACAAACGCACAAAGAATATTTTACACTCCTCCCCAACGCAATACC 

• • • • a 

AACCGC AATTCATCATACCTCATATAAATACAATACAATAC AATACATCC 2000 
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Figure 6 (sheet 3 of 7) 

, • • • • 

ATCCCTACCCTCAAGTCCACCCATCCTATAATCAATCCCTACTTACTTAC 

• • • • • 

TTCTCCCCCTCCCCCTCACCCTTCCCAGAACTCACCCCCGAAGTAGTAAT 2100 

• • ■ • • 

AGTAGTAGTAGAAGAAGCAGACGACCTCTCCACCAATCTCTTCGGCCTCT 

« • • • • 

TATCCCCATACGCTACACAAAACCCCCACCCCGTTAGCATGCACTCAGAA 22 00 

« • • • • 

AATAATCAAAAATAACTAAGAAGGAAAAAAAAGAAGAAGAAAGGTTACAT 

• • • • • 

ACTCCTCTCATACAAACTCCAAGACGTATACATCAAGATGGGCAATCCCA 2300 

• • • • • 

CCATTACTGATATCCATCTATGAACCCATTCCCATCCCACGTTAGTTGAT 

■ • • • • 

TACTTTACTTAGAAGAAGAAAAAGGGAAGGG AAGGGAAAG AAGTGGATGG 2400 

, • • • • 

GATTGAGTTAGTGCTCACCGTCTCGCAGCAAGTTTATATTCTTTTGTTTG 

, • • • • 

GCGGATATCTTTCACTGCTCCTGCTGGACGTTGTCACGGGGTGGTAGTGG 2500 

• • • ♦ ' . 

TTGGCGGTGGTGAGGGTCCATGATCACTCTTGGTTTGGGGGGTTGTTGTT 

« ■ • • • 

GTCGTTGTTGTTGTTGTTGGGTGGGCATTTTCTTTTCTTCACTTGGGGAT 2600 

• • • • • , 

TATTATTTGGAATTGGTTAGTTTGAGTGAGTGGGTAATATTGAATGGGTG 

» • • • • 

ATTATTGGGAATGAAGTAGATTTGGCTATGAATGGTTGATGGGATGGAAT 2700 

• • • • • 

GAATGGATGGATGAATAGATGGAGGCGGAAAAGTCAGGTGGTTTGAGGTT 

• • . • ■ • 

CGGATTATTATCTTTGTGCCTG AGGCATCACTCTCCATCTATGTTGTTCT 2800 

, • • • • 

TTCTATACCGATCTACCAGAGCTAAGTTGACTGATTCTACCACAGTGCAC 

• • • • 

AATAAGTATGTACTTATTTC ATTTAGAGTATTT AG ATTAACCCGCTGTGC 2900 

, • • • • 

TATTTGCCGTAGCTTTCCACCCAATTTCGAAGTTCGAAGAATTAAAACTC 

, • • • • 

ATCCTACAGTACAGAATAGAAGTAAAAGGAGAAGAGAAAAACAAGATAAT 3000 
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Figure 6 (sheet 4 of 7) 

• • • • • 

ACAACCAGTCCAGGTCCATTCTAGATCTCGAATGACCACCAAATAAGAAA 

■ • • • • 

GCAACAAGCAAGTAAGCAAAGCATAAGTCTAAATGAACGCCAATAACTTC 3100 

• • • • • 

ATCGCCTGCCTTTGAAACTGAACGCTATGCACGAATGGCTCGAAATGATT 

• • • • • 

CCCTTAACTCC6TAGTATTGAGAGTGAGAGGAAAAGAAAAAAAGAGACA6 3200 

• • • • • 

AAAAGCTGACCATGGGAAAGAAGCATGATCAGTCGGGAATGGATCTGCGG 

• • • • • 

GTTGAGATAGATATGAGTTGCCTCGCAGATCCGGTGACAAGATAAGAGAA 3300 

• • • • • 

TTGGGAGATGTGATCAGCCACTGTAACTTCATCAAGCATCGACATTCAAC 

• • ■ • « 

GGTCGGGTCTGCGGGTTGAGATGCAAGTTGAGATGCCACGCAGACCCGAA 3400 

CAGAGTGAGAGATGTGAGACTTTTGAACCACTGTGACTTCATCAAGCATC 

AAAACACACTCCATGGTCAATCGGTTAGGGTGTGAGGGTTGATATGCCAG 3 500 

• • • • « 

GTTCGATGCCACGCAGACCCGAACCGACTGAGAAATATGAAAAGTTGGAC 

• • • • • 

AGCCACTTCATCTTCATCAAGCGTAAAACCCCAATCAATGGTAAATCGAA 3600 

• • • • • 

AACGAATCTGCGGGCTGATGTGGAAATGAGACGAATGCCTCGCAGATTCG 

AAGACACGTAAATCGAGATGAACAATCACTTTAACTTCATCAAAGCCTTA 3700 

• • ■ • • 

AATCACCCAATGGCCAGTCTATTCGGGTCTGCGGGTTGAGGTTCCTGTTG 

• • « • • 

AGATGCC ACGCAGACTGCGAACATGCGATGCATTATAAGTTGGACGAGTG 3 800 

• • • • • 

TAGACTGACCATTGATAACCGAGATAAACAATCACTTCAACTTCATCAAA 

• • • • • 

GCCTTAAATCACTCAATGGCCAGTCTGTTTGCGGTCTGCGGGCTGATACC 3900 

• • • • • 

CAAGTTGCGATGCCACGCAGACTGCAAACATTGATCGAGAGACGAGAAAA 

• • • • • 

ACAACGCACTTTAACTTCAACAAAAGCCTTTCAATCAGTCAATGGCCAGT 4000 
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Figure 6 (sheet 5 of 7) 

« • • • • 

CTGTTCGCGGTCTGCGGGCTGATATGCGAGTTGAGGTGCCTCGCAGACCG 

• ■ • • • 

CGAACATGCGATGTAATTTCTTAGTTAGACGAGTGCCTGGCCATTGAGAA 4100 

• * • • • 

ACGAGAGAAACAACCACTTTAACTTCATGAAAGCCTTGAACTACTCAATG 

• • • • > 

ACCCGTCTGTTGGCGGTCTGCGGGCTGAT ATTCGAGTTGAGATGCCACGC 4200 

• • • • * 

AGACCGCCAACATGCGATGTATCATGTAAGTTAGATGAGTGACTGGCCAT 

• • • • • 

TGAGAAACGAGAGAAACAACCACACTTCATGAGAGCCTTAAATTATTCAA 43 00 

■ • • • • 

TGACCAGTCTGTTCACGGTCTGCGGGTTGGTATGCGAGTCGAGGTGCCTC 

• • • • • 

GCAGACCGCGAACATGCGATGTTTTCGATGGACGAGTGAAGCCTGACGAT 4400 

• • • • • 

CGAGAACTATCTCAGTTGGGTTGGCCATTCGGCTGGCCGTTGGGTTTAGT 

• • • • • 

ATTAGGATCGTCAGGTTTGTCCGATGGAACGTTCCGTTTGCGTGCGTTGG 4500 

• • • • • 

CGCGACGAGCCCTCTCCTCGGCGTGATTCTGAAATTCTGCAATCAGGGCA 

• • • • • 

GCCGCAGCACGGCGACGGGACGTCCTCCAGGAGCTGTGTTGAAGTTTCGG 4 600 

• » • • ■ 

GGTGGCGGTCCAGAAGGGGGAGTTACATTAAAAGCCTCATAGATGTCTTT 

• • • • • 

GGGTGGTTCCGGGGGGCCCATCGCAAGATCTTCTGGAGTTGTGCGTCTGA 4700 

• • • ♦ • 

TCATCTCTTGAGTGTAATTGCGACGCAGACCGAGCTTCAGGATTTTGGAA 

• • • • • 

GGGCTGGATCGCTCCTGCTGACTCTTTCCCTCAGCGGGCTTCGTCTCGGC 4800 

• • • • • 

AGTCTTCATTTCGGCGGGCTGATCTTCCATCTCAGAATGGGATCGCTTTC 

• • • • • 

TGGTCGCTGCACCCGCTCCTCCCTTCAAGGTCAGCTTGATGCGCAGCGTC 4900 

• • • • • 

TTGGGCGGCTCAGCTGGTGGAGTTGGTTCCGGCTCTGGCTCCCTCCGGCG 

• • • • • 

TCGCTTGGGCACTTGAGTAGTCTCTGAGGCTTCGCCGCGGCGCCGTTTGC 5000 
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Figure 6 (sheet 6 of 7) 

• • • • • 

GAGTCGGCTCCTTGGTCTCTTTGGCCTCTTTCACTTCACCTGGACCGTCT 

• • • • • 

TTCGGGGCGGTTTCATCGTGCTGAGCGATCAAGGTTTGGATGTAGGCAGC 5100 

CGGCATCATTCGATCAACGGCAATTCCTCTCTTGCGGGCCTCCTCCCGAG 

• • • • • 
CCTTGATTGTCGCCTTGACCTCGTCCACGTTTTCGAAGAAGAAAGGCATC 5200 

• f • • • 

TTGTTATCCTGAGGCAAGTTGCGCTCTCCCATGCGTGGGGATATCCGAAG 

• • • • • 

ATGCGGTCCTTCTCGAACTGTTCATGAGACTTCAGACGAATTGGAGGCTG 5300 

« • • • • 

GGGGAGCAATTTGTCTCCGTAGGTGTTGTTAGGGCGGAACCAAGAATAGC 

CTTCGCCTACAACGACAAGCTCTTCGCCAAATTTATTTTTTTGGCCTGTA 5400 
AAAACGAACCCATCCTCGTCAGTCCACCGGTGCGTCTCGGACGTAGAGAT 

• • a • • 

TGGCTTACTTATTCCCTCAACGCCGATCTCTGCCTGGGGCTGCGCTTCGG 5500 

ATGCGGCCTCGGTCACGGCTCCGCCTCGGACTGCACCGCTGGAGTTTCGG 

• • • • • 

TCTTCTTCTCCTGCTTCTCCAGGTACTCCTTGCGTAACTCTTCGATCAGC 5600 

• • • • • 

CTCGGCTTCCGATGACTGCTCAAATTCTGGAGCAACAGCTGCCGCGGCCA 

• ■ ■ • • 

GGTCAAGCAGGCGGTTTGCTAAAACTGCCCATTTTCCATCGACACCTGCC 5700 

TCCGACGCCTGTGCAAAACCAGCTGTTTTCGCATTGGCCTGTTTGTTGGC 

• • • • • 

ACGCGTCTTCTTGACTGCTGCCTTGCCCTTTACTTCCTTGAGAGCAGACT 5800 

CTGGCTTAGATGATGGTGCACGGTTTCTGCGGAAGCGCCGCTCAGATTCC 

• • • • • 

AAAGATTCCATAGCTTTAATGGTAGGCTTTCTGGTTCTTCCAGAAGTGCG 5900 

• • • • • 

CGCAGCTGACGTAGTGGTTGAGTAGCTGGCAGTTGGGGATCCTGGGCCCT 

» • • • • 

CATTGGAACCATCAAGACCAAATTTGTTTCCATACATATCAGCATGGTAT 6000 
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Figure 6 (sheet 7 of 7) 

« • • • • 

TCAAAAGGAAAACTTTCGCCGTACGGAGTACTGCGTTCGATTCCGGGTGT 

. • • • • 

ATCCAAGTCGTATCCAGACATGGTGTCGAATTCAGCCTTGCTGTCAAGAG 6100 

. . • • • 

CAGGGGTACTTTCAATGCTGTCAGCAACCACGCGGCCAAAGGGCGTCTTC 

« • • • • 

GGGAAAGAAGGTGTTTCAAGAGAAGCGTCATCCACGGCCTGGCTTGCGGC 6200 

. • • • • 

GTTGATTGCAGACTTTCGAGTAGATCGCTGAGGTCGCGAACTGGTTCGAG 

. • • • • 

TAGCAACCTGTGAATTGGCAGCCTTGTGACTGCTTCGATTCACTGCAGAG 63 00 

• • • • • 

ACGGAGTAGACTGCACTGATTTGGAATTCTGAGTCGCAGCCATTCTGGAT 

. • • • • 

TTGCGTTCGGCGCGACGAGATCTCGCAGTCGTGGTACGAGGAGTAGAGCG 6400 

» • • • • 

AGGCTGCGTAGCAGTGTTGCAAGCTTGGTGCTAGCCTCCTGGGCTTCAGC 

• • « • • 

AGCTTCAGCAGTGGTGGCAGACGCAGCAGAATTAGCGGAGCTTTATCGGC 6500 

• • • • • 

TTTGCCGCTCTGAGCGTTGGGAGTAGAAGTGAGAGAAGAGGTAGAGTCCA 

• • « • • 

CGGAAGAAGTCTTCTCGCTGTTCTCAAAGCCGTTCAGCTTTGCTGGCATA 6600 

• • a • • 

GACTTACGCGTCTTGCGGCTGTTGGAAGCGGAAGAGTTCATGGCGGGAGA 

■ « • « • • 

GGAGACGTTAGAAGTAGACATGGTGGGGTTTGTTGACGGGTTTTGAGTAA 6700 

« • • • * 

CAAGAGACTTGCGTCGATCTTTGAGTGTTCTTGACAGAAAGTTATGCAAC 

GTCGAC 6756 
Sail 
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Figure 8 (sheet 1 of 2) 



ATGGGCGTCTCTGCTGTTCTACTTCCTTTGTATCTCCTGTCTGGAGTCAC 
MGVSAVLLPLYLLSGVT 
-23 -20 ' -10 

CTCCGGACTGGCAGTCCCCGCCTCGAGAAATCAATCCAGTTGCGATACGG 100 
SGLAVPASRNQSSCDT 
' -1+1 ' 10 

TCGATCAGGGGTATCAATGCTTCTCCGAGACTTCGCATCTTTGGGGTCAA 
VDQGYQCFSETSHLWGQ 
' 20 ' 

TACGCACCGTTCTTCTCTCTGGCAAACGAATCGGTCATCTCCCCTGAGGT 200 
YAPFFSLANESVISPEV 
30 ' 40 

GCCCGCCGGATGCAGAGTCACTTTCGCTCAGGTCCTCTCCCGTCATGGAG 
PAGCRVTFAQVLSRHG 
'50 '60 

CGCGGTATCCGACCGACTCCAAGGGCAAGAAATACTCCGCTCTCATTGAG 300 
ARYPTDSKGKKYSALIE 
/ 70 ' 

GAGATCCAGCAGAACGCGACCACCTTTGACGGAAAATATGCCTTCCTGAA 
EIQQNATTFDGRYAFLK 
80 ' 90 

GACATACAACTACAGCTTGGGTGCAGATGACCTGACTCCCTTCGGAGAAC 400 
TYNYSLGADDLTPFGE 
' 100 ' 110 

AGGAGCTAGTCAACTCCGGCATCAAGTTCTACCAGCGGTACGAATCGCTC 
QELVNSGIKFYQRYESL 
/ 120 ' 

ACAAGGAACATCGTTCCATTCATCCGATCCTCTGGCTCCAGCCGCGTGAT 500 
TRNIVPFIRSSGSSRVI 
130 ' 140 

CGCCTCCGGCAAGAAATTCATCGAGGGCTTCCAGAGCACCAAGCTGAAGG 
ASGKKFIEGFQSTKLK 
' 150 ' 160 

ATCCTCGTGCCCAGCCCGGCCAATCGTCGCCCAAGATCGACGTGGTCATT 600 
DPRAQPGQSSPKIDVVI 
' 170 ' 

TCCGAGGCCAGCTCATCCAACAACACTCTCGACCCAGGCACCTGCACTGT 
SEASSSNNTLDPGTCTV 
180 ' 190 

CTTCGAAGACAGCGAATTGGCCGATACCGTCGAAGCCAATTTCACCGCCA 700 
FEDSELADTVEANFTA 
' 200 ' 210 
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Figure 8 (sheet 2 of 2) 

CGTTCGTCCCCTCCATTCGTCAACGTCTGGAGAACGACCTGTCCGGTGTG 
TFVPSIRQRLENDLSGV 
' 220 ' 

ACTCTCACAGACACAGAAGTGACCTACCTCATGGACATGTGCTCCTTCGA 800 
TLTDTEVTYLMDMCSFD 
230 ' 240 

CACCATCTCCACCAGCACCGTCGACACCAAGCTGTCCCCCTTCTGTGACC 
TISTSTVDTKLSPPCD 
/ 250 ' 260 

TGTTCACCCATGACGAATGG ATCAACTACGACTACCTCCAGTCCTTGAAA 900 
LFTHDEWINYDYLQSLK 
/ 270 ' 

AAGTATTACGGCCATGGTGCAGGTAACCCGCTCGGCCCGACCCAGGGCGT 
KYYGHGAGNPLGPTQGV 
280 ' 290 

CGGCTACGCTAACGAGCTCATCGCCCGTCTGACCCACTCGCCTGTCCACG 1000 
GYANELIARLTHSPVH 
' 300 ' 310 

ATGACACCAGTTCCAACCACACTTTGGACTCGAGCCCGGCTACCTTTCCG 
DDTSSNHTLDSSPATFP 
' 320 ' 

CTCAACTCTACTCTCTACGCGGACTTTTCGCATGACAACGGCATCATCTC 1100 
LNSTLYADFSHDNGIIS 
330 ' 340 

CATTCTCTTTGCTTTAGGTCTGTACAACGGCACTAAGCCGCTATCTACCA 
ILFALGLYNGTKPLST 
' 350 ' 360 

CGACCGTGGAGAATATCACCCAGACAGATGGATTCTCGTCTGCTTGGACG 1200 
TTVBNITQTDGFSSAWT 
/ 370 ' 

GTTCCGTTTGCTTCGCGTTTGTACGTCGAGATGATGCAGTGTCAGGCGGA 
VPPASRLYVEMMQCQAE 
380 ' 390 

GCAGGAGCCGCTGGTCCGTGTCTTGGTTAATGATCGCGTTGTCCCGCTGC 1300 
QEPLVRVLVNDRVVPL 
/ 400 ' 410 

ATGGGTGTCCGGTTGATGCTTTGGGGAGATGTACCCGGGATAGCTTTGTG 
HGCPVDAL6RCTRDSFV 
' 420 ' 

AGGGGGTTGAGCTTTGCTAGATCTGGGGGTGATTGGGCGGAGTGTTTTGC 1400 
RGLSFARSG6D WAECFA 
430 ' 440 

TTAG 1404 
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